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NORMAL SUBGROUPS IN THE CREMONA GROUP 



SERGE CANTAT AND STEPHANE LAMY 



Abstract. Let k be an algebraically closed field. We show that the Cremona 
group of all birational transformations of the projective plane P^ is not a 
simple group. The strategy makes use of hyperbolic geometry, geometric 
r-{ ' group theory, and algebraic geometry to produce elements in the Cremona 

group which generate non trivial normal subgroups. 

>0 _ Resume. Soft k un corps algebriquement clos. Nous montrons que le groupe 

de Cremona, forme des transformations birationnelles du plan projectif P£, 

On'est pas un groupe simple. Pour cela nous produisons des elements engen- 
drant des sous-groupes normaux non triviaux a l'aide de theorie geometrique 
"^J ■ des groupes, de geometrie hyperbolique et de geometrie algebrique. 
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1. Introduction 

1.1. The Cremona group. The Cremona group in n variables over a field k 
is the group Bir(F^) of birational transformations of the projective space P£ or, 
equivalently, the group of k- automorphisms of the field k(xi, • • • , x n ) of rational 
functions of PJ^. In dimension n = 1, the Cremona group coincides with the 
group Aut(Pj c ) of regular automorphisms of P^, that is with the group PGI_2(k) 
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NORMAL SUBGROUPS IN THE CREMONA GROUP 2 

of linear projective transformations. When n > 2, the group Bir(P£) is infinite 
dimensional; it contains algebraic groups of arbitrarily large dimensions (see 
[2, 39]). In this article, we prove the following theorem. 

Main Theorem, //k is an algebraically closed field, the Cremona group Bir(P^) 
is not a simple group. 

This answers one of the main open problems concerning the algebraic structure 
of this group: According to Dolgachev, Manin asked whether Bii^P 2 -.) is a simple 
group in the sixties; Mumford mentionned also this question in the early seventies 
(see [36]). The proof of this theorem makes use of geometric group theory and 
isometric actions on infinite dimensional hyperbolic spaces. It provides stronger 
results when one works over the field of complex numbers C; before stating these 
results, a few remarks are in order. 

Remark 1.1. a.— There is a natural topology on the Cremona group which 
induces the Zariski topology on the space Bir^P 2 .) of birational transformations 
of degree d (see [39, 2] and references therein). J. Blanc proved in [3] that Bir(P^) 
has only two closed normal subgroups for this topology, namely {Id} and Bir(P^) 
itself. J. Deserti proved that Bir(PQ) is perfect, hopfian, and co-hopfian, and that 
its automorphism group is generated by inner automorphisms and the action of 
automorphisms of the field of complex numbers (see [15, 14]). In particular, 
there is no obvious algebraic reason which explains why Bii^P 2 -,) is not simple. 
1.1. b.— An interesting example is given by the group Aut[A£]i of polynomial 
automorphisms of the affine plane A 2 , with Jacobian determinant equal to 1. 
From Jung's theorem, one knows that this group is the amalgamated product 
of the group of affine transformations Sl_2(k) ix k 2 with the group of elementary 
transformations 

f(x,y) = {ax + p(y),y/a), a G k*, p{y) G k[y], 

over their intersection (see [33] and references therein); as such, AutfA 2 ,]! acts 
faithfully on a simplicial tree without fixing any edge or vertex. Danilov used 
this action, together with small cancellation theory, to prove that AutfA 2 .]! is 
not simple (see [12] and [24]). 

1.1. c— The group Bir(P 2 -,) contains a linear group with Kazhdan property (T), 
namely PGI_3(C), and it cannot be written as a non trivial free product with 
amalgamation. Thus, even if our strategy is similar to Danilov's proof, it has to 
be more involved (see §6.1.2). 

1.2. General elements. Let [x : y : z] be homogeneous coordinates for the 
projective plane P 2 ,. Let / be a birational transformation of P 2 ,. There are three 
homogeneous polynomials P, Q, and R G k[x, y, z] of the same degree d, and 
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without common factor of degree > 1, such that 

f[x:y:z] = [P:Q: R]. 

The degree d is called the degree of /, and is denoted by deg(/). The space 
Bird(P^) of birational transformations of degree d is a quasi-projective variety. 
For instance, Cremona transformations of degree 1 correspond to the automor- 
phism group Aut(Pj^) = PGI_3(k) and Cremona transformations of degree 2 form 
an irreducible variety of dimension 14 (see [8]). 

We define the de Jonquieres group J as the group of birational transfor- 
mations of the plane P^ which preserve the pencil of lines through the point 
[1:0:0]. Let J^ be the subset of J made of birational transformations of degree 
d, and let V^ be the subset of Bir^(P^) whose elements are compositions h\o f oh 2 
where hi and h 2 are in Aut(Pj^) and / is in J^. The dimension of Bir^(P^) is 
equal to 4d + 6 and V^ is its unique irreducible component of maximal dimension 
(see [37]). 

On an algebraic variety W, a property is said to be generic if it is satisfied 
on the complement of a Zariski closed subset of W of codimension > 1, and is 
said to be general if it is satisfied on the complement of countably many Zariski 
closed subsets of W of codimension > 1. 



Theorem A. There exists a positive integer k with the following property. If g is 
a general element o/Bir^(P^,), and n is an integer with n> k, then g n generates 
a normal subgroup of the Cremona group Bir(P^,) that does not contain any 
element f of Bir(P^) \ {Id} with deg(f) < deg(g) n . 

To prove Theorem A we choose g in the unique irreducible component of maxi- 
mal dimension V^. Thus, the proof does not provide any information concerning 
general elements of the remaining components of the variety Bi^P^). As a 
corollary of Theorem A, the group Bir(P^,) contains an uncountable number of 
normal subgroups (see §6.3). 

Remark 1.2. We provide an explicit upper bound for the best possible constant 
k, namely k < 86611. We do so to help the reader to follow the proof. Our 
method gives a rather small constant for large enough degrees, namely k < 375, 
but does not answer the following open question: Is the best constant k equal to 
1 ? In other words, does a general element of Bi^P^) generates a strict normal 
subgroup of Bir(Pp,) for all degrees d > 2 ? (Note that g n is not a general element 
of B\r d n(F 2 c ) if n > 2 and g G Bir d (P^)) 

1.3. Automorphisms of Kummer and Coble surfaces. If X is a rational 
surface, then the group of automorphisms Aut(X) is conjugate to a subgroup of 
Bir(PQ). In §5.2, we study two classes of examples. The first one is a (generalized) 
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Kummer surface. Let Z[i] C C be the lattice of Gaussian integers. We start with 
the abelian surface Y = E x E where E is the elliptic curve C/Z[i]. The group 
SI_2(Z) acts by linear automorphisms on Y, and commutes with the order 4 
homothety rj(x,y) = (ix,iy). The quotient Y/r] is a (singular) rational surface 
on which PSI_2(Z) acts by automorphisms. Since Y/r] is rational, the conjugacy 
by any birational map <f>: Y/r] — > Fq provides an isomorphism between B\r(Y/r]) 
and Bir(P^i), and therefore an embedding of PSI_2(Z) into Bir(P^i). 

Theorem B. There is an integer k > 1 with the following property. Let M be 
an element o/SI_2(Z) with trace \tr{M)\ > 3. Let gm be the automorphism of 
the rational Kummer surface Y/r] which is induced by M. Then, gu generates 
a non trivial normal subgroup of the Cremona group Bir(PQ) ~ B\r(Y/r]). 

Theorems A and B provide examples of normal subgroups coming respectively 
from general and from highly non generic elements in the Cremona group. In 
§5.2.3, we also describe automorphisms of Coble surfaces that generate non trivial 
normal subgroups of the Cremona group; Coble surfaces are quotients of K3 
surfaces, while Kummer surfaces are quotient of abelian surfaces. Thanks to a 
result due to Coble and Dolgachev, for any algebraically closed field k we obtain 
automorphisms of Coble surfaces which generate proper normal subgroups in the 
Cremona group Bir(P£). The Main Theorem follows from this construction (see 
Theorem 5.20). 

1.4. An infinite dimensional hyperbolic space (see §4). The Cremona 
group Bir(P^) acts faithfully on an infinite dimensional hyperbolic space which 
is the exact analogue of the classical finite dimensional hyperbolic spaces HP. 
This action is at the heart of the proofs of Theorems A and B. To describe it, 
let us consider all rational surfaces it: X — > P£ obtained from the projective 
plane by successive blow-ups. If n' : X' — > P£ is obtained from tt: X — > P£ by 
blowing up more points, then there is a natural birational morphism ip: X' — > X, 
defined by (p = ir^ 1 o ir' , and the pullback operator ip* embeds the Neron-Severi 
group N X (A) (8) R into N X (X') <g> R (note that U l (X) ® R can be identified with 
the second cohomology group H 2 (X, R) when k = C). The direct limit of all 
these groups N 1 (A) R is called the Picard-Manin space of P£. This infinite 
dimensional vector space comes together with an intersection form of signature 
(l,oo), induced by the intersection form on divisors; we shall denote it by 

(H,[/3]WM-[/3]. 

We obtain in this way an infinite dimensional Minkowski space. The set of 
elements [a] in this space with self intersection [a] ■ [a] = 1 is a hyperboloid with 
two sheets, one of them containing classes of ample divisors of rational surfaces; 
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this connected component is an infinite hyperbolic space for the distance dist 
defined in terms of the intersection form by 

cosh(dist([a], [/?])) = [a] ■ [£]. 

Taking the completion of this metric space, we get a complete, hyperbolic space 
Hg. The Cremona group acts faithfully on the Picard-Manin space, preserves 
the intersection form, and acts by isometries on the hyperbolic space HLg. 

1.5. Normal subgroups in isometry groups. In Part A, we study the gen- 
eral situation of a group G acting by isometries on a 5- hyperbolic space %. Let 
us explain the content of the central result of Part A, namely Theorem 2.9, in 
the particular case of the Cremona group Bir(P?) acting by isometries on the hy- 
perbolic space Hg. Isometries of hyperbolic spaces fall into three types: elliptic, 
parabolic, and hyperbolic. A Cremona transformation g G Bir(P^) determines a 
hyperbolic isometry g* of Hg if and only if the following equivalent properties 
are satisfied: 

• The sequence of degrees deg(g n ) grows exponentially fast: 
\{g) := limsup (deg(g n )^ n ) > 1; 



• There is a gf* -invariant plane V g in the Picard-Manin space that intersects 
Hg on a curve Ax(g*) (a geodesic line) on which g* acts by a translation; 
more precisely, d\st(x,g*(x)) = log(A(g)) for all x in Ax(g*). 

The curve Ax(g*) is uniquely determined and is called the axis of <?*. We shall 
say that an element g of the Cremona group is tight if it satisfies the following 
three properties: 

• The isometry g* : Hg — > Hg is hyperbolic; 

• There exists a positive number B such that: If / is an element of Bir(P^) 
and /*(Ax(<7*)) contains two points at distance B which are at distance 
at most 1 from Ax(#*) then f*(Ax.(g*)) = Ax(<7„,); 

• If / is in Bir(P£) and /.(Ax^*)) = Ax(^), then fgf~ l = g or g~ l . 

The second property is a rigidity property of Ax(<?) with respect to isometries 
/* with / in Bir(Pji). The third property means that the stabilizer of Ax(g) 
coincides with the normalizer of the cyclic group g . Applied to the Cremona 
group, Theorem 2.9 gives the following statement. 

Theorem C. Let g be an element of the Cremona group Bir(P^). If the isometry 
g* : Hg — > Hg is tight, there exists a positive integer k such that g k generates a 
non trivial normal subgroup o/Bir(P^). 
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Theorems A and B can now be rephrased by saying that general elements 
of Bir^(P^,) (resp. automorphisms gM on rational Kummer surfaces) are tight 
elements of the Cremona group (see Theorems 5.14 and 5.20). 

1.6. Description of the paper. The paper starts with the proof of Theorem C 
in the general context of S- hyperbolic spaces (section 2), and explains how this 
general statement can be used in the case of isometry groups of spaces with 
constant negative curvature (section 3). Section 4 provides an overview on the 
Picard-Manin space, the associated hyperbolic space Hg, and the isometric ac- 
tion of Bir(P^i) on this space. Algebraic geometry and geometric group theory 
are then put together to prove Theorem A (§5.1), Theorem B, and the Main 
Theorem (§5.2). At the end of the paper we list some remarks and comments. 

1.7. Acknowledgment. Thanks to Thomas Delzant, Koji Fujiwara, and Vin- 
cent Guirardel for interesting discussions regarding hyperbolic groups and geo- 
metric group theory, and to Jeremy Blanc, Dominique Cerveau, Julie Deserti, 
Igor Dolgachev, and Jean-Pierre Serre for nice discussions on the Cremona group 
and Coble surfaces. 

Part A. Hyperbolic Geometry and Normal Subgroups 

2. GROMOV HYPERBOLIC SPACES 

This section is devoted to the proof of Theorem 2.9, a result similar to Theo- 
rem C but in the general context of isometries of Gromov hyperbolic spaces. 

2.1. Basic definitions. 

2.1.1. Hyperbolicity. Let (H, d) be a metric space. If x is a base point of H, the 
Gromov product of two points y, z £ Ti is 

(y\ z )x = 2 {<%> x ) + d ( z > x ) - d (y> z )} ■ 

The triangle inequality implies that (y\z) x is a non negative real number. Let 5 
be a non negative real number. The metric space (T~L,d) is (^-hyperbolic in the 
sense of Gromov if 

(x\z) w > min{(x\y) w , {y\z) w } - 5 

for all x,y,w, and z in %. Equivalently, the space H is 5-hyperbolic if, for any 
w, x,y,z £ H, we have 

d(w,x) + d(y, z) < max{d(i(;,y) + d(x, z),d(w,z) + d(x,y)} + 5. (2.1) 



NORMAL SUBGROUPS IN THE CREMONA GROUP 7 

Two fundamental examples of <5-hyperbolic spaces are given by simplicial metric 
trees, or more generally by R-trees, since they are O-hyperbolic, and by the n- 
dimensional hyperbolic space HP 1 , or more generally cat(-I) spaces, since both 
are 8- hyperbolic for 5 = log (3) (see [25, chapter 2], [9]). 

2.1.2. Geodesies. A geodesic segment from x to y in T-L is an isometry 7 from 
an interval [s, t]cR into % such that 7(5) = x and 7(i) = y. The metric space 
(H, d) is a geodesic space if any pair of points in T~L can be joined by a geodesic 
segment. Let [x,y] C rt be a geodesic segment, and let 7: [0,/] — > rt be an arc 
length parametrization of [x,y], with 7(0) = x and j(l) = y. By convention, for 
< s < I — t < I, we denote by [x + s, y — t] the geodesic segment j([s, I — i\). 

A geodesic line is an isometry 7 from R to its image V <Z%. If x is a point 
of ri and T is (the image of) a geodesic line, a projection of x onto T is a 
point z S r which realizes the distance from x to I\ Projections exist but are 
not necessarily unique (in a tree, or in HP, the projection is unique). If % is 
(5-hyperbolic and if z and z' are two projections of a point iona geodesic V, the 
distance between z and z' is at most 25. In other words, the projection is unique 
up to an error of magnitude 25. In what follows, we shall denote by 7rr(a;) any 
projection of x on T. 

2.2. Classical results from hyperbolic geometry. In what follows, 7i will 
be a 5-hyperbolic and geodesic metric space. 

2.2.1. Approximation by trees. In a 5-hyperbolic space, the geometry of finite 
subsets is well approximated by finite subsets of metric trees; we refer to [25] 
chapter 2, or [9], for a proof of this basic and crucial fact. 

Lemma 2.1 (Approximation by trees). Let H be a 5-hyperbolic space, and (xq, 
x\, ■ ■ ■ ,x n ) a finite list of points in H. Let X be the union of the n geodesic 
segments [xq,Xj\ from the base point xq to each other Xi, 1 < i < n. Then there 
exist a metric tree T and a map § : X ^ T such that: 

(1) $ is an isometry from [xo,Xj] to [&(xq), &(xi)] for all 1 < i < n; 

(2) for all x, y £ X 

d(x,y) - 25log 2 (n) < d($(x), $(y)) < d(x,y). 

The map <&: X 1— > T is called an approximation tree. For the sake of sim- 
plicity, the distance in the tree is also denoted d(-,-). However, to avoid any 
confusion, we stick to the convention that a point in the tree will always be 
written under the form $>(x), with x & X. 
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Convention. When n < 4 then 21og 2 (n) < 4; choosing 9 in R + such that 
9 = 45, we have 

6(x,y) - 9 < d($(x), *(i/)) < d(x, y). (2.2) 

From now on we fix such a 0, and we always use the approximation lemma in 
this way for at most 5 points (xo, • • • , X4). The first point in the list will always 
be taken as the base point and will be denoted by a white circle in the pictures. 
Since two segments with the same extremities are (5-close, the specific choice of 
the segments between xq and the Xi is not important. We may therefore forget 
which segments are chosen, and refer to an approximation tree as a pair (<&,T) 
associated to [xq,--- ,x n ). In general, the choice of the segments between xq 
and the X\ is either clear, or irrelevant. 

In the remaining of §2.2, well known facts from hyperbolic geometry are listed; 
complete proofs can be found in [25] , [9] , or in the long version of this paper [7] . 
First, the following corollary is an immediate consequence of the approximation 
lemma, and Lemma 2.3 follows from the corollary. 

Corollary 2.2. Let $ : X 1— >■ T be an approximation tree for at most 5 points. 
Then 

(<5>(x)My)) Mz) -°-< (x\y) z < (<&(s)|<lKy))* W + 9 
for all x, y, and z in X. In particular (x\y) z < 9 as soon as <3?(z) 6 [$(x), 3>(y)]- 

Lemma 2.3 (Obtuse angle implies thinness). Let T C % be a geodesic line. Let 
x be a point ofH and a £T be a projection of x onto T. For all b inT and c in 
the segment [a,b] C T, we have (x\b) c < 19. 

2.2.2. Shortening and weak convexity. The following lemma says that two geo- 
desic segments are close as soon as there extremities are not too far. Lemma 3.1 
makes this statement much more precise in the context of cat(-I) spaces. 

Lemma 2.4 (A little shorter, much closer. See 1.3.3 in [13]). Let [x,y] and 
[x',y'] be two geodesic segments ofH such that 

(i) d(x, x') < /3, and d(y, y') < /3 

(ii) d(x,y) >2p + 49. 
Then, the geodesic segment [x' + f3 + 9,y' — /3 — 9] is in the 29 -neighborhood 
of[x,y]. 

The following lemma is obvious in a CAt(O) space (by convexity of the distance, 
see [5, p. 176]). 

Lemma 2.5 (Weak convexity). Let [x,y] and [x',y'] be two geodesic segments 
ofH such that d(x,x') and d(y,y') are bounded from above by (3. Then [x',y'] is 
in the (/3 + 29) -neighborhood of [x,y]. 
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2.2.3. Canoeing to infinity. The following lemma conies from [13] (see §1.3.4), 
and says that hyperbolic canoes never make loops (see §2.3.13 in [31]). This 
lemma is at the heart of our proof of Theorem C. 

Lemma 2.6 (Canoeing). Let yo,- ■ ■ ,y n be a finite sequence of points in T~L, such 
that 

(i) d(yj,2/j_i) > 100 for all 1 < % < n; 
(ii) (yi + i\yi-i) yi < 30 for all 1 < i < n - 1. 

Then, for all 1 < j < n, 

(1) d(y , yj ) > d(ift,,s/j-i) + 20 ifj > 1; 

(2) d(y , yj ) > Ei=Myi,yi-i) - 79); 

(3) yj is W -close to any geodesic segment [yo,y n ]. 

2.3. Rigidity of axis and non simplicity criterion. Let G be a group of 
isometries of a <5-hyperbolic space (T~L,d), and g be an element of G. Our goal is 
to provide a criterion which implies that the normal subgroup {{g)) generated by 
g in G does not coincide with G. 

2.3.1. Isometries. Let / G Isom(%) be an isometry of a 5-hyperbolic space 
(%,d). The translation length L(f) of / is the infimum of d(x,f(x)) where 
x runs over all points of 7i. We denote by Min(/) the set of points that realize 
L(f). Following [5, page 229] the isometry / is called 

(1) elliptic if / has a fixed point (in this case L{f) = and Min(/) is the 
set of fixed points) ; 

(2) hyperbolic if d(x,f(x)) attains a strictly positive minimum; 

(3) parabolic if the infimum L(f) is not realized by any point in H. 

A hyperbolic isometry / has an invariant axis if there is a geodesic line V 
such that /(r) = r. In this case, V is contained in Min(/) and / acts as a 
translation of length L(f) along V. If T and I"" are two invariant axis, each of 
them is in the 20-tubular neighborhood of the other (apply Lemma 2.4). When 
/ has an invariant axis, we denote by Ax(/) any invariant geodesic line, even if 
such a line is a priori not unique. 

Remark 2.7. If (%,d) is CAt(O) and 5-hyperbolic, every hyperbolic isometry 
has an invariant axis (see [5, chap. II, theorem 6.8]). When "H is a tree or a 
hyperbolic space HP, the set Min(g) coincides with the unique geodesic line it 
contains. If G is a hyperbolic group acting on its Cayley graph, any hyperbolic 
g € G admits a periodic geodesic line; thus, there is a non trivial power g k of g 
which has an invariant axis. 
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2.3.2. Rigidity of geodesic lines. If A and A' are two subsets of %, the intersec- 
tion 'with precision a oi A and A' is the intersection of the tubular neighbor- 
hoods Tub Q (A) and Tub a (A'): 

A n a A' = {x G H;d(x, A) < a and d(x, A') < a} . 

Let e and -B be positive real numbers. A subset A of H is (e, I?)-rigid if f(A) = A 
as soon as / G G satisfies 

diam(An e /(A)) > 5. 

This rigidity property involves both A and the group G. The set A is e-rigid is 
there exists a positive number B > such that j4 is (e, i?)-rigid. If e' < e and 
^4 C % is e-rigid, then A is also e'-rigid (for the same constant B). The converse 
holds for geodesic lines (or convex sets) when e' is not too small: 

Lemma 2.8. Let e > 29. If a geodesic line T C H is (29,B)-rigid, then V is 
also (e, B + 6e + 49) -rigid. 

Proof. Let B' = B + 6e + 40. Suppose that diam(r f\ /(r)) > B' . We want to 
show that f(T) = T. There exist x,y G T, x',y' G /(r) such that d(x,x') < 2e, 
d(y,y') < 2e, d(x,y) > B' - 2e and d(x',y') > B' - 2e. By Lemma 2.4, the 
segment 

[u, v] := [x' + 2e + 9, y - 2e - 9] C T 

is 26 l -close to [x,y], and we have d(u, v) > B. Thus T C\2e /(T) contains the two 
points n and v and has diameter greater than B. Since T is (29, £>)-rigid, we 
conclude that T = f(T). □ 

2.3.3. Tight elements and the normal subgroup theorem. We shall say that an 
element g G G is tight if 

• g is hyperbolic and admits an invariant axis Ax(g) C Min(g); 

• the geodesic line Ax(g) is 20-rigid; 

. for all / G G, if /(Ax( 5 )) = Ax( 5 ) then /g/^ 1 = g or g" 1 . 

Note that if g is tight, then any iterate g n , n > 0, is also tight. We now state 
the main theorem of Part A. 

Theorem 2.9 (Normal subgroup theorem). Let G be a group acting by isome- 
tries on a hyperbolic space %. Suppose that g G G is tight, with a (149, B) -rigid 
axis Ax(g) . Let n be a positive integer with 

nL(q) „ „ 

— > 609 + 2B. 

20 ~ 

Then any element h ^ Id in the normal subgroup ((g n )) C G satisfies the following 

alternative: Either h is a conjugate of g n or h is a hyperbolic isometry with 
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translation length L{h) > nL(g). In particular, if n > 2, the normal subgroup 
((g n )) does not contain g. 

Remark 2.10. Delzant in [13] gives a similar result when G is a hyperbolic group 
and T~L the Cayley graph of G; our strategy of proof, presented in paragraph 2.5, 
follows the same lines. 

In the definition of a tight element, we could impose a weaker list of hypothesis. 
The main point would be to replace Ax(g) by a long quasi-geodesic segment 
(obtained, for example, by taking an orbit of a point x in Min(g)); this would 
give a similar statement, without assuming that g has an invariant axis. Here, we 
take this slightly more restrictive definition because it turns out to be sufficient 
for our purpose, and makes the proof less technical; we refer to [13] or to the 
work in progress [11] for more general view points. 

We now prove Theorem 2.9. First we need to introduce the notion of an 
admissible presentation. 

2.4. Relators, neutral segments and admissible presentations. Let g £ G 

be a hyperbolic isometry with an invariant axis; let Ax(g) be such an axis (see 
Remark 2.7), and L = L(g) be the translation length of g. All isometries that 
are conjugate to g have an invariant axis: If / = sgs -1 , then s(Ax(g)) is an 
invariant axis for /. 

2.4.1. Axis, relators, and neutral segments. Let [x,y] C H be an oriented geo- 
desic segment, the length of which is at least 209. The segment [x,y] is a rela- 
tor if there exists an element s in G such that [x, y] is contained in the tubular 
neighborhood Tnb 7 g(s(Ayi(g)). If [x,y] is a relator, the conjugates / = sgs^ 1 of 
g and / _1 = sg~ 1 s~ 1 of g^ 1 have the same invariant axis T = s(Ax(g)), and 
this axis almost contains [x,y]. Changing / to /~ x if necessary, we can assume 
that 7rr(y) and /(7rr(x)) lie on the same side 1 of vrr(rc) in I\ This assumption 
made, the isometry / is called a support of [x,y] (and so f" 1 is the support 
of the relator [y, x]). The segment [x,y] is a relator of size p/q if furthermore 
d(x,y) > -L. We say that [x,y] contains a relator of size p/q if there is a 
segment [x',y'] C [x, y] which is such a relator. 

A pair of points (x, y) is neutral if none of the geodesic segments [x, y] between 
x and y contains a relator of size 11/20; by a slight abuse of notation, we also 
say that the segment [x, y] is neutral if this property holds (even if there are a 
priori several segments from x to y). 



Since d(x,y) > 20$, this assumption is meaningful: It does not depend on the choices of 
the projections of x and y on Ax(/). 
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2.4.2. Admissible presentations. Let h be an element of the normal subgroup 
{{g)) C G. We can write h as a product h = h^hk-i • • • h\ where each hj is a 
conjugate of g or its inverse: 

V 1 < i < k, 3 Sj 6 G, /ij = sigs~ or Sjg~ s^ . 

By convention, each of the /ij comes with Si € G and thus with an invariant axis 
Ax(>i) = Si(Ax( 5 )). 

We fix a base point xq € %, and associate three sequences of points (a,), (6j), 
and (xj), 1 < i < k, to the given base point xq and the factorization of h into 
the product of the hi = Sig ±l s^ . The definition requires two steps. First, for 
all 1 < i < k, 

• Xi is equal to /t,(xj_i); in particular xu = h(xo); 

• Oj is the projection of X\-\ on the geodesic Ax(/ij); 

• bi is equal to hi(di); in particular, both aj and 6j are on Ax(/ij). 

This is the first step toward the definition of the three sequences of points. 

Remark 2.11. If the translation length L is larger than 4800 then, after a slight 
modification of the Oj and bi, the following properties are satisfied: 

(1) Each [ai,bi] is a subsegment of Ax(/ij) of length at least ^L; 

(2) (Xi-i\bi) ai < 29 and (ai\xi) b . < 29; 

(3) all segments [xj_i,aj] and [6j,Xj] have length at least 109 (for 1 < i < k). 

To obtain property (3) when [xj„i,a.j] is too small, we translate aj towards bi 
on a small distance (at most 129), and similarly we translate bi towards a,. The 
first property remains true as long as 20 > 249. The second one follows from 
Lemma 2.3. 

In a second step, we use this remark to move the points a\ and b\ along the 
geodesic line Ax(/ij), and to define new sequences (aj), (bi), (xi) that satisfy the 
three properties (1), (2), and (3). The definition of (xi) has not been changed 
during this second step. The point a\ is no more equal to a projection of Xj on 
Ax(/ij), but is at distance at most 129 from such a projection; the point bi is not 
equal to /ij(ai), but the distance between these two points is less than 249. 

We say that hk • • • hi is an admissible presentation of h (with respect to 
the base point xo) if one can construct the three sequences as above with the 
additional property that all pairs (xj_i,<2j), 1 < i < k, are neutral. 



Lemma 2.12. Let xq be a base point in 7~L. Let g be an element of G, and h be 

L_ 
20' 



an element of the normal subgroup generated by g. Assume that 289 < ^. Then 



h admits at least one admissible presentation. 
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Remark 2.13. Thanks to this lemma, the following property can be added to 
the first three properties listed in Remark 2.11: 

(4) All pairs (a^_i,ai) and (bi,Xi) are neutral. 

Proof of Lemma 2.12. Start with a decomposition h = hk ■ ■ ■ h\ with hi = sig ±l s^ 
and construct the sequences of points (a^), (&»), and (xj) as above. Let I be the 
set of indices 1 < i < k such that (xj_i,Oj) is not neutral. Suppose I is not 
empty (otherwise we are done). Our goal is to modify the construction in order 
to get a new decomposition of h which is admissible. 

Changing the decomposition.- Pick i £ I. Then there are two geodesic seg- 
ments [y,z] C [xj_i,aj] (with y £ [xj_i,«]) and a conjugate / = sgs' 1 of g 
such that [y,z] is 7#-close to Ax(/) = s(Ax(g)) and d(y, z) > 2qL; we fix such 
a segment [y, z] with a maximal length. Let y',z' be projections of y and z on 
Ax(/); we have d(y, y') < 76 and d(z, z') < 79. If y' is contained in the segment 
[f(y'),z'] C Ax(/), we change / into / . Replace hi by the product of three 
conjugates of g 

(hif^h-^hif. 

This gives rise to a new decomposition of h as a product of conjugates of g , 
hence to new sequences of points. Let a\ be a projection of Xj_i on Ax(/). 
Concerning the sequence (xj), we have two new points x\ = /(xj_i) and x^ +1 = 
/ij(x'j); the point x^ +2 = hif~ x h~[ (x' i+l ) is equal to the old point x% = /ij(xj_i). 
Thus, the neutral pair (xi_i,aj) disappears and is replaced by three new pairs 
(xj_i,o9, (x^a^), and (x' i+1 , a' i+2 ) ( see Figure 1). Note that if any of these 
segments is small then it is automatically neutral; so without loss of generality 
we can assume that there was no need to move a' i7 a' i+1 or a' i+2 as m Remark 
2.11. 

Two estimates- Our first claim is 

(1) d(xi_i, oj) = d(x' i+v a' i+2 ) < d(xi_i, o<) - ±§L. 
To prove it, we write 

d(xj_i,Oj) > d(xj_i,y) + d(y,z) > d(x;_i,y) + ±±L, 

and 

d(xi-i,a{) < d(x,_i, 2/') < d(^„i, y) + d(j/, J/') <d(x;_i,y) + 70; (2.3) 

claim (1) follows because 70 < Jj. 

Our second claim is 

(2) d(x^X +1 ) < d(a?i_i, a*) - ^L. 
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Figure 1. 

To prove it, note b the projection of o« on Ax(/). Similarly as (2.3)we have 

d(6,Oi) <d(z,<n) + 79. (2.4) 

There are two cases, according to the position of f{a' i ) with respect to the seg- 
ment [a' i: b] C Ax(/). If /(a£) is in this segment, then 

dix'^ai) < d(f(x l . 1 ),f(a' l )) + d(f(a' i ),b) + d(b,a i ) 

< d(xi-i,a' i ) + d(a' i ,b) — L + d(b,ai). 

Applying Lemma 2.3 for the triangles Xj_i, y', a' t and a^, z' , b, and the inequalities 
d(y,y'), d(z,z') < 79, we get d(xj_i,y) = d(xj_i,a-) + d(y,a' i ) up to an error of 
119, and similarly d{ai,z) = d(ai,b) + d(z,b) up to 116*. Hence, we get 

d(x'i,ai) < d(xi_i,y) + d(y,z) + d(z,ai) - L + 229 

< d(x i - 1 ,a i )-L + 229. 

This concludes the proof of (2) in the first case, because —L + 22$ < — L/20. 

The second case occurs when b is in the segment [a£, /(a^)] C Ax(/) (see 
Figure 1). In this case we have 

d(x^) < d(/(x i _ 1 ),/(o;)) + d(/(a5),6) + d(6 > a i ) 



< d(xj_i, z) - %L + 76* + L - d(a-, 6) + d(z, a*) + 70, 



and thus 



d04oi) < d(a*_i,Oi)-d(<4 &) + &£ + 140 



(2.5) 
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On the other hand, the triangular inequality implies 

d(xj_i,o-) + d(oi,6)+d(6,Oi) > d(ij_i,Oi) 

= d(xj_i,y) +d(y,z) +d(z,<n). 

and so 

d(a-,6) > ±±L + d(xi-i,y) -d{x i -x,a! i ) + d(z,Oj) -d(6,o»). 

Using the inequalities (2.3) and (2.4) we obtain 

d((4,6)>iiL-14#. 
Finally inequality (2.5) gives 

d04a' i+1 ) <d(^,a;) < d(xi_i,ai) - ^L + 280 
hence claim (2) because 280 < ^n. 

Conclusion- Since the modification is local, and does not change the points 
a,j for j / i, we can perform a similar replacement for all indices i in X. We 
obtain a new presentation for h and a new list of bad indices I' '. Either this new 
list is empty, or by the two estimates (claims (1) and (2)) the maximum of the 
lengths d(xj-i,a,j) over all the non neutral pairs (xj-i, a,j), j £ 1', drops at least 
by wL. By induction, after a finite number of such replacements, we obtain an 
admissible presentation. □ 

The following lemma provides a useful property of admissible presentations 
with a minimum number of factors hi. 

Lemma 2.14. Let h = hf. ■ ■ ■ h\ be an admissible presentation with base point xq. 
If there exist two indices j > i such that hj = /i,~ , then h admits an admissible 
presentation with base point xq and only k — 2 factors. 

Proof. We may assume that j > i + 2, otherwise the simplification is obvious. 
The decomposition for h is then 

h = hk- • ■ hj+ih~ hj-i ■ ■ ■ hi + \hihi^\ ■ ■ ■ h\. 

Note that i can be equal to 1, and j can be equal to k. We have a sequence of 
triplets (ai,bi,Xi), i = 1, ■ ■ ■ ,k, associated with this presentation and with the 
base point xo Then we claim that 

h = h k --- h j+1 (hr 1 h j ^ 1 hi)(hr 1 h j ^ 2 hi) ■ ■ ■ (h~ l hi + \hi)hi-i ■ ■ ■ hi 
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is another admissible presentation with base point xq and with k — 2 factors. 
Indeed the sequence of k — 2 triplets associated with this new presentation is 

(ai,6i,xi),--- ,(a i -i,bi-i,Xi^ 1 ),(hl 1 (ai +1 ),hl 1 (bi +1 ),h^ 1 (x i+ i)),--- , 

(hr 1 (aj^i), hr 1 (bj^), hr 1 (xj^i)), (a j+ i,b j+1 , x j+1 ), ■ ■ ■ ,(a k ,b k ,x k ) 

and one checks that all relevant segments are neutral because they come from 
neutral segments of the previous presentation. □ 

2.5. Proof of the normal subgroup theorem. Let g be a tight element of 

G. Choose B > a constant associated with the 140-rigidity of the axis of g. 

Choose n > 1 such that 

ft 

—L(g) > 609 + 2B. 
20 y ! ~ 

Let L = nL(g) be the translation length of g n . Note that g n £ G is tight, with a 
(140, l?)-rigid axis Ax(g n ) = Ax(g). Let /ibea non trivial element of the normal 
subgroup ((g n ))- Our goal is to prove L(h) > L, with equality if and only if h is 
conjugate to g n . 

Pick a base point xo such that d(xo,h(xo)) < L(h) + 8. Lemma 2.12 applied 
to g n implies the existence of an admissible presentation with respect to the base 
point xq: 

h = h m o ■■■ ohi, hi = s i g ±n s~ 1 . 

We assume that m is minimal among all such choices of base points and ad- 
missible presentations. Lemma 2.14 implies that hj is different from hj for all 
1 < i < j < m. 

Let (aj), (bi), (xi) be the three sequences of points defined in §2.4; they satisfy 
the properties (1) to (4) listed in Remarks 2.11 and 2.13. Since the constructions 
below are more natural with segments than pair of points, and % is not assumed 
to be uniquely geodesic, we choose geodesic segments between the points Xi, as 
well as geodesic segments [ai,&j] C Ax(/ij). 

We now introduce the following definition in order to state the key Lemma 2.15 
(compare with Lemma 2.4 in [13]). A sequence of points (c_i, Co, • • • , c k , c k +i) 
in "H, and of a chain of segments [cj,Cj+i], —1 < i < k, is a configuration of 
order k > 1 for the segment [xo,Xj] if 

(i) xo = c_i and Xj = c k+ i] 
(ii) For all < i < k, and all a £ [a, q+i], we have (q_i|a) Ci < 36. In particular 

(ci-i\c i+ i) Ci < 39; 
(iii) For all < i < k + 1 we have d(ci_i,Cj) > 109; 
(iv) For all < i < k the segment [cj,Cj+i] is either neutral or a relator, with 

the following rules: 
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(iv-a) There are never two consecutive neutral segments; 

(iv-b) The last segment [cfc,Cfc_|_i] = [ck,Xj] is neutral; 

(iv-c) The second segment [co,ci] is a relator of size 18/20 if [01,02] is 

neutral (this is always the case when k = 1), and of size 17/20 

otherwise; 
(iv-d) For the other relators [cj_i,c;], with i > 1, the size is 5/20 when 

[cj,Cj + i] is neutral and 4/20 otherwise; 
(v) For all < i < k, if [q,Cj+i] is a relator, then there is an index I with 
1 < I < j, such that hi is the support of the relator [cj,Cj+i]. 

Note that properties (iv) and (v) do not concern the initial segment [xo,co], 
and that the size p/q of a relator [cj,Cj + i], < i < k — 1, is equal to 18/20, 
17/20, 5/20, or 4/20; moreover, this size is computed with respect to L = L(g n ), 
and thus the minimum length of a relator [ci,Cj+i] is bounded from below by 
4(600 + 25). 

Lemma 2.15. For each j = 1, • • • ,m, there exists k > 1 such that the segment 
[xo,Xj] admits a configuration of order k. Moreover if j > 2 and k = 1 then the 
first segment [xq,cq] of the configuration has length at least ^L. 

Proof. The proof is by induction on j, and uses the four properties that are listed 
in Remarks 2.11 and 2.13; we refer to them as properties (1), (2), (3), and (4). 
Note that (2) and (3) enable us to apply Lemma 2.6; similarly, properties (ii) 
and (hi) for a configuration of order k show that Lemma 2.6 can be applied to 
the sequence of points in such a configuration. 

Initialization- When j = 1, we take cq = a±, c\ = b\ and get a configuration 
of order 1. Indeed, by property (2), we have 

{xo\bi) ai < 26 and (ai|xi) fel < 29, 

and, by property (1), [co,ci] is a relator of size 19/20 (it is a subsegment of 
Ax(/ii)). The segments [xo,co] and [xi,ci] are neutral (property (4)) and of 
length at least 10$ (property (3)). 

Suppose now that [xo,Xj] admits a configuration of order k. We want to find 
a configuration of order k' for [xo,Xj+i]. As we shall see, the proof provides a 
configuration of order k' = 1 in one case, and of order k' < fc + 2 in the other case. 

Six preliminary facts.- Consider the approximation tree ($,T) of (xj, xq, 
Xj+i). We choose p £ [xj,xq] such that $(p) G T is the branch point of the 
tripod T (with p = Xj if T is degenerate). Let a (resp. b) be a projection of aj+i 
(resp. bj+i) on the segment [xj,Xj+i\. By assertion (3) in Lemma 2.6 we have 
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d(a,aj + i) < 50 and d(6, foj+i) < 50. Thus, the subsegment [a, b] C [xj,xj + {\ is 
contained in [xj + 59, Xj + \ — 59]; moreover, by Lemma 2.5, [a, b] is a relator with 
support hj + \. 

For all z < k + 1, note c[ a projection of Cj on the segment [xq, xj]. Note that, 
by Lemma 2.6, we have d(cj, c[) < 50, and the points cj, —1 < i < A; + 1, form a 
monotonic sequence of points from xq to Xj along the geodesic segment [xo,^]- 

Let Si be the interval of T defined by Si = [$(a), $(6)] D [$(cQ, $(c' i+1 )]. The 
preimages of Si by $ are two intervals I C [a, b] and I' C [c / i ,c' i+1 ] such that 

$(/) = $(/') = S t . 

Fact 2.16. 7/[cj,Cj + i] is neutral then diam(S'j) < 2qL. 

Since $ is an isometry along the geodesic segments [xj,xo] and [ x ji x j+i]> we 
only have to prove diam(/ / ) < 20L. By Lemma 2.5 we know that [cj,Cj+i] C 
Tub75)([4, 4 +1 ]). Thanks to the triangular inequality, we can choose J C [cj, Q+i] 
such that J C Tub7e(J / ) and diam(J) > diam(J') — 140. The properties of the 
approximation tree imply I' C Tubg(I), and / C Tub7e(Ax(/i,, + i)). Thus J C 
Tubi5g(Ax(/ij + i)). Applying Lemma 2.4 (with j3 = 169) and shortening J by 320 
(160 at each end), we obtain diam(J) > diam(/ / )— 460 and J C Tub2#(Ax(/ij + i)). 
Thus J is a relator contained in the neutral segment [cj,Cj+i], hence 

iil, > diam(J) > diam(l') - 460, and therefore ±§L > diam(l'). 

Fact 2.17. If [cj,Q+i] is a relator then diam(S'j) < B. 

By property (v), there exists an index I, with 1 < I < j, such that [cj, Cj+i] C 
Tub7e(Ax(/i/)). Since /' C Tub 7 e([cj, Cj+i]), we get 

I' C Tub 14e (Ax(/i i )). 

On the other hand I' C Tubg(I) and / C Tub^Ax^+i)), thus 

/' C Tub 8e (Ax(/i i+1 )). 

If diam(/ / ) > B, the rigidity assumption shows that hj+i and hi share the 
same axis, with opposite orientations; since g is tight, we obtain hj+i = hT . 
By Lemma 2.14 this contradicts the minimality of the presentation of h. This 
proves that diam(J') < B. 

Fact 2.18. Suppose [cj,Cj+i] is a relator. Then 

diam([d>(x J ),$(a)]n[cI>(c'J,$(4 +1 )]) < %L. 
diam([*(6),*(x i+ i)]n[*(c{),^(c{ +1 )]) < %L. 



NORMAL SUBGROUPS IN THE CREMONA GROUP 19 

The two inequalities are similar; we prove only the first one. By property (v), 
there exists an index I, with 1 < / < j, such that hi is the support of [cj,cj+i]. 
Let K C [a, Xj] and K' C [c£, c' i+1 ] be two intervals such that 

*(K) = *(K') = [*(*,■)> *(<*)] n [<&(<& *(4+i)]- 

We want to prove diam(ET) < -^L. Applying the triangular inequality, we can 
choose J C [aj + i,Xj] such that J C Tub^g(K) and diam(J) > diam(iT) — 
140. Now we have K C Tube (if), K' C Tub7e([cj,Cj + i]) and [c.;,Cj + i] C 
Tub 7e (Ax(/i,)). Thus J C Tub 22 e(Ax(^)). Applying Lemma 2.4 with /3 = 230, 
we shorten J by 460 (230 on each end) and obtain 

J C Tub 20 (Ax(/i;)) and diam(J) > diam(iQ - 600. 

The admissibility condition implies that J C [aj+i,Xj] is neutral, and therefore 

|jL > diam(J) > diam(if) - 600, so that ±§L > diam(iT). 

Fact 2.19. The segment [<!>(&), <£(a)] is not contained in the segment [$(cq), 

Let us prove this by contradiction, assuming [$(&), $(a)] C [^Kcq), <3?(xj)]. 
By Property (iv-a) and Fact 2.16, [$(&), $(a)] intersects at least one segment 
[$(c-), $(c- +1 )] for which [a, Cj+i] is a relator; Fact 2.17 implies that [$(c-), $(c- +1 )] 
is not contained in [<£(«), <&(»)] (it must intersect the boundary points of [<&(«), $(&)])• 
From this follows that [$(a), $>(&)] intersects at most two relators and one neutral 
segment. Facts 2.16 and 2.17 now give the contradictory inequality 2B + ^qL > 

Fact 2.20. The segment [3>(cg), 3>(c^)] is not contained in the segment [$(xj+i), 

Since [aj+i,Xj] is neutral and [co,ci] is a relator of size > 17/20, the segment 
[$(cq),$(c' 1 )] is not contained in [<3?(), $(xj)]. Assume that [$(c' ), &(c[)] C 
[$(xj+i),$>(xj)], apply Fact 2.19 and then Fact 2.17 and Fact 2.18; this gives 
the contradictory inequality B + ^L > ^L — 100. 

These last two facts imply <3?(o) E [$(xj+i),®(p)] and <&(c' ) E [$(x ),$(p)]. 
Moreover, with this new property in mind, the proofs of Facts 2.19 and 2.20 give: 

Fact 2.21. The segment [$(&), $(p)] has length at least JjL, and the segment 
[<3>(cq), $(p)] has length at least ^L. 

In particular the tripod T is not degenerate at $(xq) nor at <&(xj +1 ). 
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The induction- We now come back to the proof by induction. We distinguish 
two cases, with respect to the position of 3>(a) relatively to the branch point $(p). 



i First case. (*(6)|*(x ))*( a ) < m L ~ 12e - 



In other words we assume that either $(a) G [$(&), <&(p)], or $(a) is close to 
$(p). Note that this includes the situation where $(xj) is a degenerate vertex of 
T. By Fact 2.21, the situation is similar to Figure 2. Since the distance between 
Oj+i (resp. bj+i) and a (resp. 6) is at most 59, the triangular inequality and 
Corollary 2.2 imply 

(b j+ i\x ) a . +1 < (b\x ) a + (WO + 59 + 59)j2 
<(^(b)Mx )) Ha) + 9 + W9, 

and the assumption made in this first case gives 

(b j+1 \x ) aj+1 <^L-9. (2.6) 




o 



Figure 2. The point <J>(a) could also be on [<J>(p), <3>(x.,)], but 
not far from <&(p). 

We now consider another approximation tree \1/: X >-+ T' , for the list (oj+i, 
6j_i_i, xo), and choose a point g 6 [aj + i,bj + i] such that \I/(g) is the branch point 
of T' (see Figure 3). 



?*(<y+i) 



*(6 J+ i) *(g) *(x ) 

Figure 3. 



We then define a new configuration of points c_i = xo, Co = g, ci = &j+i, 
C2 = Xj+i, and we show that this defines a configuration of order 1 for [xq, xj+i]. 



NORMAL SUBGROUPS IN THE CREMONA GROUP 21 

Using inequality (2.6), the fact that ^ is an isometry along [o 3 -+i, 6j+i], and 
Corollary 2.2, we obtain 

dMj+i) > ^§L - (6 i+ i|x )a J+1 - 9 > ^L. 

Thus [q, bj + i] is a relator of size 18/20 with support hj + ±: This gives properties 
(iv-c) and (v). 

By Lemma 2.3 we have (q\xj+i)b +1 < 20, and by Corollary 2.2 we have 
(xo\bj+i) Pl < 9, so we obtain property (ii); property (hi) follows from the defini- 
tion of bj+i (property (3) in Remark 2.11). Thus, (cj)—i<j<2 is a configuration 
of order 1 for the segment [xo,Xj+i]. 

Moreover, we have 

d(x ,q) = (y{a j+1 )\y(b j+1 ))y {xo) 

> (a j+1 \b j+ i) xo - 9 

> (a\b) xo - 119 

> (<&(a)|$(6)) $(xo) - 129 
>d($(x o ),$(p))-120. 



Fact 2.21 then gives 



d(x o ,q)>±L-120>±L 



so we obtain the second assertion in Lemma 2.15. 

• Second case. (<&(&) |<&(xo))$( ) > ^L - 129. 

Let i be the index such that Q(p) € [$(c^),$(c^ +1 )]. This index is uniquely 
defined if we impose 3>(c^) ^ &(p)- The assumption implies <&(a) £ [&(p), $(xj)] 
and d($(p), <&(a)) > ^L — 129. By Fact 2.21 again, the situation is similar to 
Figure 4. We distinguish two subcases according to the nature of [q, Cj+i]. 



*(&) 

*(x i+ i) 




' " • • o 



FIGURE 4. On this picture, <&(4 +1 ) is in between $(p) and <J>(a) 
another possible case would be 3>(a) € [$(p), $(4 +1 )]. 
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Second case - first subcase. Assume [cj,cj+i] is neutral. Then if % < k the 
segment [cj+i,Ci+2] is a relator (if i = k the following discussion is even easier). 
If d($(p),$(cj +1 )) < 346* then Fact 2.17 implies d($(p),*(a)) < B + U9 < 
2qL— 120. This contradicts the assumption of the second case; as a consequence, 

min{d($(p), *(4 +1 )), d($(p), 3(a))} > 340. (2.7) 

Consider now the approximation tree VP: .X »->• T" of (cj+i, q, Oj+i, &j+i). We 
have 

(*(ci+i)l*(oi+i))*(6 J+1 ) > (c-+i|a) 6 - 150 - 

> ($(4 +1 )|$(a)) $(6) - 170 
>d($(6),$(p))-170 
+ mM{d($(p),$(cJ +1 )),d($(p),$(a))} 

and 

(*(ci)|*(a i+ i))* (6i+1 ) < (q|a) fe + 150 + 

<(<&(cQ|<l>(a))* ( 6) + 170 
<d($(&),$(p)) + 170. 
By (2.7) we get 

(V(* +1 )Ma j+1 )) 9(f>i+l) > (*(c i )|*(a i+ i)) g/(6i+l) . 

Thus we obtain the pattern depicted on Figure 5, where q is a point of [cj, Cj+i] 
which is mapped to the branch point of T'. Note that [ci,q] C [c$, Ci+i] is again 
a neutral segment. 



*(Ci) 



*(6 i+1 ) 

• 



*(«) 



*(ci+i) 



*(%'+l) 



Figure 5. 

Since the point g is 40-close to Ax(/ij+i), the segment [q, ftj+i] is a relator with 
support /ij+i. We have 

d(6 i+ i,g)>d(*(6 i+ i),*(g)) 

> (Cj|oj + i)6 i+1 - 

> (cj|a) 6 - 150 - 
>d(S(&),$(p))-170. 
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Thus by Fact 2.21 we see that [q, bj + \] is a relator of size 5/20, and the same 
conclusion holds even if one moves q along the segment [q, frj+i] by at most 
126. The inequalities (xj + i\aj + i)t, j+1 < 26 and (a,j + i\bj + i) q < 6 imply that 
(x j+ i\q)b i+1 < 36*. 

In this first subcase, we define the new configuration (q) by q = q for I 
between —1 and i, and by q+i = q, q+2 = &j+i and q+3 = Xj+i; this defines a 
configuration of order i + 2 for the segment [xq, Xj+i]. 

By construction, properties (i), (ii), and (v) are satisfied, and property (iii) is 
obtained after translating q along [q, 6j+i] on a distance less than 126. This does 
not change the fact that [q, &j+i] is a relator of size 5/20. Since the new config- 
uration is obtained from the previous one by cutting it after q and then adding 
a relator [q, bj + \] and a neutral segment [bj+i , Xj + \] , property (iv) is also satisfied. 

Second case - second subcase. Assume [q, q+i] is a relator. By Fact 2.17 the 
segment [^(p), 3>(cj + i)] has length at most B, and [q+i, Cj+2] must be neutral 
(otherwise again [$(p), <I>(a)] would be too small). Thus the relator [cj,c,+i] has 
size 5/20 (or 18/20 if i = 0). We consider the approximation tree \&: XH'Tof 
(q+i, q, Cj+2, ttj+i, bj+i). We obtain one of the situations depicted on Fig. 6, 
where now q G [cj + i,Cj] or [q+i, Ci +2 \. 

In case (a), [cj+i,g] is small, and therefore neutral. We are in a case similar 
to the first subcase and to Figure 5. We still have 

<%+i,g)>d(*(&),$(p))-170, 

and one can define a new sequence (q) by q = q for I between —1 and i + 1, 
and by q+2 = <?, q+3 = 6j+i and q+4 = Xj+i- This new sequence cuts (q) after 
I = i + 1, add a neutral segment after the last relator [q, q+i], and then a relator 
[q, bj+i] and a final neutral segment [bj+i,Xj + i]. Hence, (q) is a configuration 
of order i + 3 for the segment [xq, Xj+i\. 



*(Ci) 



*(6i+i) 




*(c i+2 ) 



*(?) *(%+i) 



*(Ci) 



*(6i+i) 



'*fe+i) 



*(c i+2 ) 



*(g) *(o,-+i) 



(o) 



(6) 



Figure 6. 
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In case (b), let us check that d(cj + i, q) < B + 15$, so that [cj, g] is a relator of 
size 4/20 (or 17/20 if i = 0). We have 

d(c m ,g) < (q|6 j+ i) Ci+1 +20 

<(*(c{)|*(6))* (c / +l) + 156> 
< £ + 150. 

As in the first subcase, the estimate for d(6j+i, g) gives 

d(6 i+1 ,g)>d($(6),$(p))-170, 

and Fact 2.21 implies that [g, bj+i] is a relator of size 5/20; moreover (xj+i\pi)b- +1 
< 30. 

We define the new configuration (q) by taking q = q for Z between —1 and i, 
and Cj+i = g, Q+2 = frj+i and Cj+3 = Xj+i; this defines a configuration of order 
i + 2 for the segment [rco, JCj+i]- 

By construction, properties (i), (ii), (iii), (iv) and (v) are satisfied. □ 

Proof of Theorem 2.9. By Lemma 2.15 there exists (cj) a configuration of order 
k for [xo,x m ], where x m = h(xo). Recall that by our choice of xo we have 
L(h) > d(x ,x m ) -6. 

If k > 2, then we have at least two distinct relators: the first one, [cq, c\\, has 
size at least 17/20, and the last one, [cfc_i,Cfc], has size 5/20. By Lemma 2.6 we 
obtain L(h) > §L - 116* > L. 

If k = 1, either m = 1 and h is conjugate to g n , or we have d(xo,co) > ^L. 
On the other hand d(co, ci) > ^|L, so we obtain L(/t) > |^L — 116* > L. □ 

3. Hyperbolic spaces with constant negative curvature 

This section is devoted to the classical hyperbolic space HP, where the dimen- 
sion n is allowed to be infinite. As we shall see, constant negative curvature, 
which is stronger than 5-hyperbolicity, is a useful property to decide whether the 
axis Ax(g) of an isometry is rigid. 

3.1. Hyperbolic spaces. 

3.1.1. Definition. Let H be a real Hilbert space, with scalar product {x-y)n, and 
let \\x\\h denotes the norm of any element x £ H. Let u be a unit vector in H, 
and u 1 - be its orthogonal complement; each element x £ H decomposes uniquely 
into x = a(x)u + v x with a(x) in R and v x in u^~. Let ( . | .} : H x H \— > R be 
the symmetric bilinear mapping defined by 

(x\y) = a(x)a(y) - (v x ■ v y ) H - 
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This bilinear mapping is continuous and has signature equal to (1, dim(H) — 1). 
The set of points x with (x\x) = is the light cone of ( . | . ). Let EI be the subset 
of H defined by 

H = {x e H ; (x\x) = 1 and (u\x) > 0}. 

The space H is the sheet of the hyperboloid {x\x) = 1 which contains u. 
The function dist: H x 14 R + defined by 

cosh(dist(x,y)) = {x\y} 

gives a distance on EI, and (H, dist) is a complete and simply connected riemann- 
ian manifold of dimension dim(H) — 1 with constant scalar curvature —1 (this 
characterizes EI if the dimension is finite). As such, EI is a cat(-I) space, and 
therefore a 5-hyperbolic space. More precisely, EI is <5-hyperbolic (in any dimen- 
sion, even infinite) with 5 = log(3) (see [9], § 1.4 page 11). In particular, all 
properties listed in §2.1 (or even stronger ones) are satisfied. 

3.1.2. Geodesies and boundary. The hyperbolic space H is in one to one corre- 
spondence with its projection into the projective space F(H). The boundary of 
this subset of F(H) is the projection of the light cone of ( . | . ); we shall denote 
it by dW and call it the boundary of EI (or boundary at infinity). 

Let r be a geodesic line in EI. Then there is a unique plane Vr C H such 
that r = Vr H EI. The plane Vp intersects the light cone on two lines, and these 
lines determine two points of dW, called the endpoints of T. If x and y are two 
distinct points of EI, there is a unique geodesic segment [x,y] from x to y; this 
segment is contained in a unique geodesic line, namely Vect(x,y) n EI. 

Let sbea point of EI and T be a geodesic. The projection irr( x ) G I\ i.e. the 
point y G T which minimizes d\st(x,y), is unique. 

3.1.3. Distances between geodesies. The following lemma shows that close geo- 
desic segments are indeed exponentially close in a neighborhood of their middle 
points; this property is not satisfied in general 5-hyperbolic spaces because 5- 
hyperbolicity does not tell anything at scale < 5. 

Lemma 3.1. Letx, x' , y, andy' be four points of the hyperbolic space EI. Assume 
that 

(i) dist(x,x') < e, and dist(y,y') < e, 
(ii) dist(x,y) > B, 
(Hi) B > We, 

where e and B are positive real numbers. Let [u, v ] C [x, y] be the geodesic seg- 
ment of length |dist(a;, y) centered in the middle of[x,y]. Then [u,v] is contained 



NORMAL SUBGROUPS IN THE CREMONA GROUP 26 



in the tubular neighborhood Tub u ([x',y']) with 

cosh(2e) 



cosh(z^) — 1 < 5- 



exp(B/2 - 4e) ' 

The constants in this inequality are not optimal and depend on the choice of 
the ratio d\st(u, v )/dist(x, y) = 3/4. A proof of this lemma can be found in [7]. 

3.1.4. Isometries. Let / be an isometry of H; then / is the restriction of a unique 
continuous linear transformation of the Hilbert space H. In particular, / extends 
to a homeomorphism of the boundary dM. The three types of isometries (see 
§ 2.3.1) have the following properties. 

(1) If / is elliptic there is a point x in EI with f{x) = x, and / acts as a 
rotation centered at x. Fixed points of / are eigenvectors of the linear 
extension / ' : H —> H corresponding to the eigenvalue 1. 

(2) If / is parabolic there is a unique fixed point of / on the boundary dM; 
this point corresponds to a line of eigenvectors with eigenvalue 1 for 
the linear extension f:H — > H. The orbit (/ (x)) of all points x in 
H converges toward this fixed point in ¥(H) when k goes to +oo and 
to — oo. 

(3) If / is hyperbolic, then / has exactly two fixed points a(f) and uj{f) on 
the boundary dM and the orbit f k (x) of every point x S EI goes to a(f) as 
k goes to — oo and w(/) as k goes to +oo. The set Min(/) coincides with 
the geodesic line from a(f) to w(/). In particular, Min(/) coincides with 
the unique /-invariant axis Ax(/) = Vect(a(/),w(/)) n EI. The points 
a(/) and uj(f) correspond to eigenlines of the linear extension / : H — > H 
with eigenvalues A -1 and A, where A > 1; the translation length L(f) is 
equal to the logarithm of A (see Remark 4.5 below). 

3.2. Rigidity of axis for hyperbolic spaces. Let G be a group of isometries 
of the hyperbolic space EI, and g be a hyperbolic element of G. The following 
strong version of Lemma 2.8 is a direct consequence of Lemma 3.1. 

Lemma 3.2. Let e' < e be positive real numbers. If the segment A C EI is 
(e',B') -rigid, then A is also (e,B) -rigid with 

5 = max{22e,18 £ + 21og5(^|^),4f + 2 e }. 

In the context of the hyperbolic spaces EI, Lemma 3.2 enables us to drop the 
e in the notation for e-rigidity; we simply say that A C EI is rigid. 

Proof. By assumption, B satisfies the following three properties 

• B-2e> 10(2e) = 2(k; 

• cosh(e') - 1 > (cosh(4e) - l)/exp(B/2 - 9e); 
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• (3/4)(B-2e) >B'. 

Let / be an element of the group G such that A f\ f{A) contains two points at 
distance B. Then A and f(A) contain two segments I and J of length B — 2e 
which are 2e-close. Lemma 3.1 and the inequalities satisfied by {B — 2e) show 
that / and J contain subsegments of length 3(B — 2e)/4 which are e'-close. Since 
A is (e' , B')-rigid, f(A) coincides with A, and therefore A is (e, Z?)-rigid. D 

Proposition 3.3. Let G be a group of isometries ofW, and g £ G be a hyperbolic 
isometry. Let n be a positive integer, p G Ax(g), and rj > 0. If Ax(g) is not rigid, 
there exists an element f of G such that f(Ax(g)) / Ax(g) and dist(/(x),x) < r/ 
for allxe [g~ n (p),g n (p)]. 

Proof. Since Ax(g) is not rigid, in particular Ax(g) is not (e, i?)-rigid for B = 
(3n + 4)L(g) and e = r//2. Then there exists an isometry h £ G such that 
h(Ax(g)) is different from Ax(g), but Ax(g) contains a segment J of length B 
which is mapped by h into the tubular neighborhood Tub £ (Ax(g)). 

Changing h into g 3 oh, we can assume that the point p is near the middle of 
the segment h(J); and changing h into hog moves J to g~ (J). We can thus 
change h into h\ = h 3 ohog k for some j, k G Z, B into B\ > B — AL(g) > 3nL(g), 
and find two points x and y on Ax(g) that satisfy 

(1) dist(x,y) > Bi, p G [x,y], and d\st(p,x) > B\/3, dist(p, y) > Bi/3; 

(2) either (a) h x {x) G Tub^" 1 ^),*]) and h^y) G Tub^" 1 ^),?/]), 

or (b) tn(y) G Tiib^fcr 1 ^),*]) and Kx(x) G TuMfo-^y]); 

(3) y J (x) and g l {y), with —2 < i < 2, are at distance at most e from h{Ax{g)). 

This does not change the axis h(Ax(g)) and the value of e. 

We now change h\ into the commutator hi = h^ g h±g. We still have 
h2(Ax(g)) ^ Ax(g), because otherwise h~[ g~ 1 hi fixes Ax(g) and by unique- 
ness of the axis of a hyperbolic isometry of EI we would have h~[ (Ax(g)) = 
Ax(g). Moreover, property (2) above is replaced by d\st(x,h2(x)) < 2e < n and 
dist(y, h,2(y)) < n incase (a), and by dist(g 2 (x), h,2(x)) < r\ and dist(y 2 (y), ^2(2/)) < 
77 in case (b); similar properties are then satisfied by the points g % (x) and g l {y), 
— 2 < i < 2, in place of x and y. 

Changing, once again, h% into /13 = /12 ° y~ 2 if necessary, we can assume that 
dist(x, hs(x)) < n and d\st(y,h-s(y)) < 77. 

Consider the arc length parametrization 

m: t £ [—00, +00] I—)- Ax.(g) 

such that m(0) = p and g(m(t)) = m(t + L(g)). Since /j is in the interval 
[x+Bi/3, j/-J3i/3], we obtain dist(m(i), h 3 (m(t))) < n for all t in [-Bi/3, Bi/3]. 
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Defining / = /13 we get 

dist(z,/(z))<i7, Vze[<r», 5 »] 
because nL(g) < B±/3. D 

In particular, the proof for n = 1 gives the following corollary. 

Corollary 3.4. Let G be a group of isometries of EL Let g be a hyperbolic 
element of G and p be a point of Ax{g) . Let n be a positive real number. If there 
is no f in G\{Id} such that d(f(x),x) < r\ for all x 6 [g~ (p),g(p)], then Ax(g) 
is (n/2,7L(g))-rigid. 

Part B. Algebraic Geometry and the Cremona Group 

4. The Picard-Manin space 

4.1. Neron-Severi groups and rational morphisms. Let X be a smooth 
projective surface defined over an algebraically closed field k. The Neron-Severi 
group N (X) is the group of Cartier divisors modulo numerical equivalence. 
When the field of definition is the field of complex numbers, N 1 (X) coincides 
with the space of Chern classes of holomorphic line bundles of X (see [34] ) , and 
thus 

N\X) = H 2 (X(C), Z) t/ . n fl" 1 ' 1 ^, R) 

where H 2 (X(C), Yi)t.f. is the torsion free part of H 2 (X(C), Z) (the torsion part 
being killed when one takes the image of H 2 (X(C), Z) into H 2 (X(C), R)). The 
rank p(X) of this abelian group is called the Picard number of X. If D is a 
divisor, we denote by [D] its class in N 1 (X). The intersection form defines an 
integral quadratic form 

([£>!], LD 2 ]W [Di]-[D 2 ] 

on N 1 (X), the signature of which is equal to (1, p(X)— 1) by Hodge index theorem 
(see [29], §V.l). We also note N 1 (X) R = N X (X) ® R. 

If -/r : X — > Y is a birational morphism, then the pullback morphism 

vr*: N x (y) ^N X (X) 

is injective and preserves the intersection form. For example, if ir is just the 
blow-up of a point with exceptional divisor E C X, then N 1 (X) is isomorphic to 
7r*(N 1 (y)) © Z[2£], this sum is orthogonal with respect to the intersection form, 
and [E] • [E] = -1. 

Example 4.1. The Neron-Severi group of the plane is isomorphic to Z[i7] where 
[H] is the class of a line H and [H] ■ [H] = 1. After n blow-ups of points, the 
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Neron-Severi group is isomorphic to Z n+1 with a basis of orthogonal vectors [H], 
[Ex], • • • , [E n ] such that [H] 2 = 1 and [Ei] 2 = -1 for all 1 < i < n. 

4.2. Indeterminacies. Let /: X — > Y be a rational map between smooth 
projective surfaces. The indeterminacy set Ind(/) is finite, and the curves which 
are blown down by / form a codimension 1 analytic subset of X, called the 
exceptional set Exc(/). 

Let H be an ample line bundle on Y. Consider the pullback by / of the linear 
system of divisors which are linearly equivalent to H; when X = Y = P 2 and H 
is a line, this linear system is called the homaloidal net of /. The base locus of 
this linear system is supported on Ind(/), but it can of course include infinitely 
near points. We shall call it the base locus of /. To resolve the indeterminacies 
of /, one blows up the base locus (see [34] for base loci, base ideal, and their 
blow ups); in other words, one blows up Ind(/), obtaining -k: X' — > X, then one 
blows up Ind(/ o it), and so on; the process stops in a finite number of steps (see 
[29], §V.5). 

Remark 4.2. If / is a birational transformation of a projective surface with 
Picard number one, then Ind(/) is contained in Exc(/). 2 

4.3. Dynamical degrees. The rational map /: X — » Y determines a linear 
map /* : N 1 (y) — > N X (X). For complex surfaces, / determines a linear map 
/* : H 2 (Y, Z) — > H 2 (X, Z) which preserves the Hodge decomposition: The action 
of /* on N 1 (X) coincides with the action by pull-back on H 2 (X(C), Z)t.f. H 
i^p^R) (see [16] for example). 

Assume now that / is a birational selfmap of X. The dynamical degree 
A(/) of / is the spectral radius of the sequence of linear maps ((f n )*) n >o'- 



A(/)= lim (||(/TI| 1/n ) 



where || . || is an operator norm on End(N 1 (A A )R,); the limit exists because the 
sequence ||(/ n )*|| is submultiplicative (see [16]). The number A(/) is invariant 
under conjugacy: A(/) = \{gfg~ l ) \i g: X --■ *• Y is a birational map. 

Example 4.3. Let [x : y : z] be homogeneous coordinates for the projective 
plane P 2 ,. Let / be an element of Bir(P 2 .); there are three homogeneous polyno- 
mials P, Q, and R E k[x, y, z] of the same degree d, and without common factor 
of degree > 1, such that 

f[x:y:z] = [P:Q: R]. 



One can prove this by considering the factorization of / as a sequence of blow-ups followed 
by a sequence of blow-downs. Any curve in Exc(/) corresponds to a ( — l)-curve at some point in 
the sequence of blow-down, but is also the strict transform of a curve of positive self-intersection 
from the source (this is where we use p(X) — 1). 
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The degree d is called the degree of /, and is denoted by deg(/). The action /* 
of / on N 1 (P^) is the multiplication by deg(/). The dynamical degree of / is 
thus equal to the limit of deg(/ n ) 1//n . 

4.4. Picard-Manin classes. We follow the presentation given in [4, 6, 22] 
which, in turn, is inspired by the fifth chapter of Manin's book [35]. 

4.4.1. Models and Picard-Manin space. A model of P^ is a smooth projective 
surface X with a birational morphism X — > P^. Let B be the set of all models of 
P£. If it: X — > P£, and ir' : X' — > P£ are two models, we say that X' dominates 
X if the induced birational map 7r _1 on' : X' — » X is a morphism. In this case, 
7T _1 ott' contracts a finite number of exceptional divisors and induces an injective 
map (vr- 1 o tt')* : ^(X) n ^ IM^Oh- 

Let Bx C B be the set of models that dominate X. Note that if X±, X2 £ Bx, 
by resolving the indeterminacies of the induced birational map X\ — * X2 we 
obtain X3 £ Bx which dominates both X\ and Xi- The space Z(X) of (finite) 
Picard-Manin classes is the direct limit 

Z(X) = lim N^XOr. 

The Neron-Severi group N 1 (X / )r of any model X' — > X embeds in Z{X) and 
can be identified to its image into Z(X). Thus, a Picard-Manin class is just a 
(real) Neron-Severi class of some model dominating X. The Picard-Manin class 
of a divisor D is still denoted by [D], as for Neron-Severi classes. Note that Z{X) 
contains the direct limit of the lattices N (X') (with integer coefficients). This 
provides an integral structure for Z{X). In the following paragraph we construct 
a basis of Z(X) made of integral points. 

For all birational morphisms n, the pull-back operator tt* preserves the in- 
tersection form and maps nef classes to nef classes; as a consequence, the limit 
space Z(X) is endowed with an intersection form (of signature (1, 00)) and a nef 
cone. 

4.4.2. Basis of Z(X). A basis of the real vector space Z(X) is constructed as 
follows. On the set of models tt: Y — > X together with marked points p € Y, 
we introduce the equivalence relation: (p, Y) ~ (p', Y') if the induced birational 
map 7r /_1 o n: Y — » Y' is an isomorphism in a neighborhood of p that maps 
p onto p'. Let Vx be the quotient space; to denote points of Vv we just write 
p G Vx, without any further reference to a model tt : Y — > X with p £Y. 

Let (p, Y) be an element oiVx- Consider Y — > Y the blow-up of p and E p C Y 
the exceptional divisor; the Neron-Severi class [E p ] determines a Picard-Manin 
class and one easily verifies that this class depends only on the class p G Vx (not 
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on the model (Y, it) , see [35] ) . The classes [E p ] , p £ Vx , have self-intersection 
— 1, are mutually orthogonal, and are orthogonal to N 1 (X)r. Moreover, 

Z{X) = N\X) R © Vect([£ p ]; P eV x ) 

and this sum is orthogonal with respect to the intersection form on Z(X). To 
get a basis of Z(X), we then fix a basis ([-ffi])i<j<p(x) of N 1 (X)r,, where the Hi 
are Cartier divisors, and complete it with the family ([E p ]) p£ \; x . 

4.4.3. Completion. The (completed) Picard-Manin space Z(X) of X is the 
L 2 -completion of Z(X) (see [4, 6] for details); in other words 

Z(X) = i [D] + Y, a pi E P^ \P\ G n1 (*)r, a P G R and X>p < +°° 
[ pev x 

whereas Z(X) corresponds to the case where the a p vanish for all but a finite 
number of p 6 Vx- For the projective plane P£, the Neron-Severi group N X (P^) 
is isomorphic to Z[H], where i? is a line; elements of Z{X) are then given by 
sums 



a [H] + ^ ap^p 



P6V 2 

C 



with Yl a p < +oo. We shall call this space the Picard-Manin space without 
further reference to P^ or to the completion. 

4.5. Action of Bir(X) on Z(X). If it: X' — > X is a morphism, then it induces 
an isomorphism it* : Z(X) — > Z(X'). Let us describe this fact when it is the 
(inverse of the) blow-up of a point q £ X. In this case we have 

N\X') = 7T*(N 1 (X))(BZ[E q ] and V x = V x > U {(g,X)} 

where (t/,^0 G Vx denotes the point of Vx given by q G X. Thus the bases 
of Z(X) and Z(X') are in bijection, the only difference being that [E q ] is first 
viewed as the class of an exceptional divisor in Z(X), and then as an element of 
N 1 (X / ) C Z(X'); the isomorphism tt* corresponds to this bijection of basis. Note 
that tt* extends uniquely as a continuous isomorphism tt* : Z(X') — > Z(X) that 
preserves the intersection form. Since any birational morphism tt is a composition 
of blow-downs of exceptional curves of the first kind, this proves that tt acts by 
isometries on both Z(X) and Z(X). 

Now consider /: P^ — > P£ a birational map. There exist a surface X and 
two morphisms tt: X — > P£, a: X — > P^, such that / = a o 7r _1 . Defining /* by 
/* = (7T*) _1 o o - *, and /* by /* = (f~ 1 )* , we get a representation 
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of the Cremona group in the orthogonal group of Z(¥ 2 J (resp. Z(P^)) with 
respect to the intersection form. This representation is faithful, because /* [E p ] = 
[Ej(p\] for all points p in the domain of definition of /; it preserves the integral 
structure of Z(X) and the nef cone. 

4.6. Action on an infinite dimensional hyperbolic space. 

4.6.1. The hyperbolic space Hg. We define 

M 2 = {[D]EZ; [D] 2 = 1,[H]-[D}>0} 

and a distance dist on Hg by the following formula 

cosh(dist([L>i], [D 2 ])) = [£>!] ■ [D 2 ]. 

Since the intersection form is of Minkowski type, this gives to Hg the structure 
of an infinite dimensional hyperbolic space, as in §3.1. 

4.6.2. Cremona isometries. By paragraph 4.5 the action of the Cremona group 
on Z preserves the two-sheeted hyperboloid {[D] G iJ(Pj^); [D] 2 = l} and since 
the action also preserves the nef cone, we obtain a faithful representation of the 
Cremona group into the group of isometries of Hg : 

Bir(P^) ^ Isom(Hj). 

In the context of the Cremona group, the classification of isometries into three 
types (see § 3.1.4) has an algebraic-geometric meaning. 

Theorem 4.4 ([6]). Let f be an element o/Bir(P^). The isometry /* o/Hg is 
hyperbolic if and only if the dynamical degree \(f) is > 1. 



As a consequence, when A(/) > 1 then /* preserves a unique geodesic line 
Ax(/) C Hg; this line is the intersection of Hg with a plane Vf C Z which 

?t and Rl 

f4a) = X(f) ±1 a 



intersects the isotropic cone of Z on two lines R\ and R f such that 



for all a G R f (the lines R~l and R , correspond to co(f) and a(/) in the notation 
of § 3.1.4). 

Remark 4.5. The translation length L(f*) is therefore equal to log(A(/)). In- 
deed, take [a] G i?7, [u] G Rf normalized such that [a] • [u] = 1. The point 
[P] = ^(N + M) is on the axis Ax(/,). Since /*[P] = -^(A(/)->] +A(/)[w]), 
we get 

eL(/ * } + JuT) = 2coBh(dist([P],/,[P])) = 2([P] • MP]) = A(/) + j^. 
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Remark 4.6. Over the field of complex numbers C, [6] proves that: /* is elliptic 
if, and only if, there exists a positive iterate f k of / and a birational map 
e: Fq —)■ X such that e o f k o e -1 is an element of Aut(X) (the connected 
component of the identity in Aut(X)); /* is parabolic if, and only if, / preserves 
a pencil of elliptic curves and deg(/ n ) grows quadratically with n, or / preserves 
a pencil of rational curves and deg(/ n ) grows linearly with n. 

4.6.3. Automorphisms. Assume that / € Bir(P^) is conjugated, via a birational 
transformation (p, to an automorphism g of a smooth rational surface X: 

X^X 

<p\ \<p 

Y Y 

P 2_ r p2 

Then we have an isomorphism 92* : 2(X) — > ^(Pj^) and an orthogonal decompo- 
sition 

2(I) = N 1 (I) R eN 1 (I) R 1 

where N 1 (X)r is spanned by the classes [E p ], p £ Vx- This orthogonal de- 
composition is ^-invariant. In particular, /* preserves the finite dimensional 



subspace ^*N 1 (X) R C £(P£). 

By Hodge index theorem, the intersection form has signature (1, p(X) — 1) on 
N 1 (X), so that i^N 1 ^)^ intersects Hj on a /^.-invariant hyperbolic subspace 
of dimension p(X) — 1. This proves the following lemma. 

Lemma 4.7. // / is conjugate to an automorphism g € Aut(X) by a birational 

map 92: X — > P£, then: 

(1) The isometry /* : Big — >• Hj is hyperbolic (resp. parabolic, resp. elliptic) 
if and only if the isometry g*\ N^X)^, — > N 1 (X)r is hyperbolic (resp. 
parabolic, resp. elliptic) for the intersection form on N 1 (X)r; 

(2) the translation length of f* is equal to the translation length of g*; 

(3) if f* is hyperbolic then, modulo (p*-conjugacy, the plane Vf corresponds 
to V g , which is contained in NS(X)n, and Ax(f*) corresponds to Ax(g*). 

4.6.4. Example: quadratic mappings (see [8]). The set of birational quadratic 
maps BI^Pq) is an irreducible algebraic variety of dimension 14. Let /: P^, — - > 
Wq be a quadratic birational map. The base locus of / (resp. Z" 1 ) is made of 
three points pi,P2,P3 (resp. qi,q2,Q3), where infinitely near points are allowed. 
We have 

U([H])=2[H]-[E qi }-[E q2 }-[E q3 } 
with [H] the class of a line in P^. If / is an isomorphism on a neighborhood of 
p, and f(p) = q, then f*([E p ]) = [E q ]. Note that these formulas are correct even 
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when there are some infinitely near base points. For instance if / is the Henon 
map 

[x : y : z] — * [yz : y — xz : z ] 
then 92 is infinitely near qx, and 93 is infinitely near 92, but nevertheless the 
formula for the image of H is still the same. 

A Zariski open subset of Bi^P^) is made of birational transformations / = 
hi o a o /i 2 , with hi 6 Aut(P2j), i = 1,2, and a the standard quadratic involution 
a[x : y : z] = [yz : zx : xy\. For such maps, {pi,P2,P?>} is the image of 
Ind(ff) = {[1 : : 0]; [0 : 1 : 0]; [0:0: 1]} by h~\ and {91,92,93} = Mlnd(a)). 
Moreover, the exceptional set Exc(/ _1 ) is the union of the three lines through 
the pairs of points (<&, qj), i ^ j. Assume, for example, that the line through q\ 
and 92 is contracted on p\ by f~ 1 , then we have 

M[E P1 ]) = [H] - [E q2 ] - [E^]. 

If / is a general quadratic map, then A(/) is equal to 2 and /* induces a hyperbolic 
isometry on HLj; we shall see that it is possible to compute explicitly the points 
on the axis of / (see §5.1.4 for precise statements) and that [H] is not on the 
axis. In fact, in this situation the axis of / does not contain any finite class 
[D] e Z(F* C ). 

5. Tight birational maps 

5.1. General Cremona transformations. In this section we prove Theo- 
rem A concerning normal subgroups generated by iterates of general Cremona 
transformations. 

5.1.1. De Jonquieres transformations. Let d be a positive integer. As mentioned 
in the introduction, the set Bir^(P^) of plane birational transformations of degree 
d is quasi-projective: It is a Zariski open subset in a subvariety of the projective 
space made of triples of homogeneous polynomials modulo scalar multiplication. 
Recall that J^ denotes the set of de Jonquieres transformations of degree d, 
defined as the birational transformations of degree d of Pq that preserve the 
pencil of lines through 90 = [1 : : 0]. Then we define V^ as the image of the 
composition map 

(hi,f,h 2 ) I-)- hi o / o h 2 
where (hi,f, h 2 ) describes PGI_3(C) x J^x PGI_3(C). As the image of an irre- 
ducible algebraic set by a regular map, V^ is an irreducible subvariety of Bir^Pp). 
The dimension of Bi^P^) is equal to Ad + 6 and V^ is the unique irreducible 
component of Bi^P^) of maximal dimension (in that sense, generic elements of 
Bir^(PQ) are contained in V^). In degree 2, i.e. for quadratic Cremona transfor- 
mations, V2 coincides with a Zariski open subset of Bi^P^). 
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Let / be an element of J^. In affine coordinates, 

f(x,y) = (B y (x),A(y)) 

where A is in PGI_2(C) and B y in PGI_2(C(y)). Clearing denominators we can 
assume that B y is given by a function B: y i— > B(y) with: 

where the coefficients a, c and b, d are polynomials of degree d — 1 and d respec- 
tively. The degree of the function det(B(y)) is equal to 2d — 2; if B is generic, 
det(-B(y)) has 2d — 2 roots j/j, 1 < i < 2d — 2, and B{yi) is a rank 1 complex 
matrix for each of these roots. The image of B{yi) is a line, and this line corre- 
sponds to a point x\ in P 2 (C 2 ). The birational transformation / contracts each 
horizontal line corresponding to a root j/j onto a point q^ = (xi,A(yi)). This 
provides 2d — 2 points of indeterminacy for / ; again, if B is generic, the 2d — 2 
points qi are distinct, generic points of the plane. The same conclusion holds 
if we change / into its inverse, and gives rise to 2d — 2 indeterminacy points 
Pit"' -,P2d~2 for /■ One more indeterminacy point (for / and / _1 ) coincides 
with po = qo = [1 : : 0]. 

An easy computation shows that the base locus of / is made of 

(1) the point po itself, with multiplicity d — 1; 

(2) the 2d — 2 single points p±, ■ ■ ■ ,P2d-2- 

Moreover, all sets of distinct points {po,Pi, • • • ,P2d-2} with po = [1 : : 0] and 
no three of them on a line through po can De obtained as the indeterminacy set 
of a de Jonquieres transformation of degree d. In particular, on the complement 
of a strict Zariski closed subset of J^, the points pi form a set of 2d — 1 distinct 
points in the plane: There are no infinitely near points in the list. Thus, for a 
generic element of V^, we have 

2d-2 



MH] = d[H]-(d-l)[E po ]-Y,iE Pi ] 



i=\ 



where the pi are generic distinct points of the plane. 

Remark 5.1. If £ C Pq is a generic set of cardinal k, and h is an automorphism 
of P 2 -,, then h is the identity map as soon as /i(E)nS contains five points. Applied 
to Ind(/), we obtain the following: Let g be a generic element of V^, and h be 
an automorphism of Pq; if h is not the identity map, then h(lnd(g)) n Ind(<7) 
contains at most four points. 
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5.1.2. Tightness is a general property in Bir^P^). Our goal is to prove the fol- 
lowing statement. 

Theorem 5.2. There exists a positive integer k such that for all integers d > 2 
the following properties are satisfied by a general element g G V^: 

(1) g is a tight Cremona transformation; 

(2) If n > k, then g n generates a normal subgroup of Bir(PQ) whose non 
trivial elements f satisfy X(f) > d n . 

Since V^ is irreducible, tight Cremona transformations are dense in V^; since 
Vrf is the unique component of Bir^F^) of maximal dimension, properties (1) and 
(2) are also generally satisfied in Bir^P^). Thus Theorem 5.2 implies Theorem A. 

5.1.3. Strong algebraic stability is a general property. Given a surface X, a bi- 
rational transformation / G Bir(X) is said to be algebraically stable if one of 
the following equivalent properties hold: 

(1) if x is a point of Ind(/), then f k (x) <£ Ind(/~ 1 ) for all k < 0; 

(2) if y is a point of Ind(/" 1 ) then f k (y) <£ Ind(/) for all k > 0; 

(3) the action /* of / on the Neron-Severi group N 1 (X) satisfies (f k )* = 
{f*) k for all fcGZ. 

If / is an element of Bir^F^,), algebraic stability is equivalent to A(/) = deg(/) 
(if / is not algebraically stable, property (3) implies A(/) < deg(/)). Condition 
(2) can be rephrased by saying that / is algebraically stable if 

/ fc (Ind(r 1 ))nInd(/) = 

for all k > 0. We now prove that general elements of Vd satisfy a property which is 
stronger than algebraic stability (recall from Remark 4.2 that Ind(/) C Exc(/)). 

Lemma 5.3. If g is a general element ofVd then 

/(Ind(^ 1 ))nExc( 5 )/0 (5.1) 

for all k > 0. In particular g is algebraically stable. 

Proof. For a fixed integer k > 0, condition (5.1) is equivalent to the existence of 
a point m G Ind(g -1 ) such that g k (m) G Exc(g), and hence to Jac g (g k (m)) = 0, 
where Jac ff , the Jacobian determinant of g, is the equation of Exc(g). This is an 
algebraic condition, that defines an algebraic subvariety 1^ in Bir^F^). 

Fact 5.4. There exists g G V^ such that g Ik for any k > 0. 

Since V^ is irreducible, this fact implies that the codimension of Ik is positive 
and shows that equation (5.1) is satisfied on the intersection of countably many 
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Zariski dense open subsets of Vd- We are therefore reduced to prove Fact 5.4. 
For this, let / be the Henon mapping defined by 



f([x:y: z\) 



1 :y d + xz d - 1 :z d 



fO 

The affine plane { z ^ 0} is /-invariant, and / restricts to a polynomial automor- 
phism of this plane. The exceptional set Exc(/) is the line at infinity {z = 0}, 
and its image Ind(/~ 1 ) is the point q = [0 : 1 : 0]. This point is fixed by /: In 
the affine chart {y ^ 0}, with affine coordinates (x, z) = [x : 1 : z] around q, the 
map / is given by 

d 
(x, z) (-)■ 



10(1 + xz d ~ l Y l + xz^ 1 
In particular, q is an attracting fixed point. Let U be the neighborhood of q 
defined by 

U = {[x:l:z];\x\< 1/10, \z\ < 1/10} . 

Then, f(U) is contained in {[x : 1 : z] ; |x| < 1/90, \z\ < 1/90}. Let h be the 
linear transformation of the plane which, in the affine coordinates (x,z), is the 
translation (x, z) H- (x + 1/20, z + 1/20). We have h(f(U)) cU\{z = 0}, and 
f(U) H h(f(U)) = 0. We now take g = ho f. Its exceptional set is the line at 
infinity {z = 0}; the unique indeterminacy point of g^ 1 is h(q) = [1/20 : 1 : 1/20]. 
By construction, for all k > 0, g fc (Ind(g -1 )) is a point of C/\Exc(g). This proves 
that g is not in Ik, for any k > 0. D 

We say that a Cremona transformation g is strongly algebraically stable 

if g and g -1 satisfy property (5.1) stated in Lemma 5.3. This lemma shows that 
general elements of V^ are strongly algebraically stable. 

Recall that algebraic stability implies that g is well defined along the forward 
orbit <7 (Ind(<7 -1 )), k > 0; similarly, g^ 1 is well defined along the backward orbit 
of Ind(g). 

Lemma 5.5. Let g be strongly algebraically stable. Then we have: 

(1) For all k>j>0, g k {lnd(g^ 1 )) n ^'(Ind^" 1 )) = 0; 

(2) For all k> j> 0, g~ k (Ind(g)) n g~ j (lnd(g)) = 0; 

(3) For all k>0 and j > 0, g k (Ind(g~ 1 )) n g~ j (lnd(g)) = 0. 

Proof. Suppose that there is a point q G # fc (Ind(g -1 )) n ^(Ind^ -1 )). Suppose 
first that j = 0. This means that there exist p,q G Ind(g -1 ) such that g fe (p) = q. 
But then g (p) G Exc(g), contradicting the assumption. Now assume j > 0. 
By the first step, we know that g~ l is well defined along the positive orbit of 
Ind(<7 _1 ). Thus we can apply g~\ which brings us back to the case j = 0. 
Property (2) is proven similarly, replacing g by g _1 . 
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Suppose (3) is false. Then there exists p £ Ind(<7~ 1 ), q € Ind(<7), k,j > 
such that g k (p) = g~^{q)- By (2) we can apply gi to the right hand side of 
this equality, thus q S g h+ i (Ind^" 1 )) n Ind(^). This contradicts the algebraic 
stability of g. □ 

5.1.4. Rigidity is a general property. Let g be an element of V^. Consider the 
isometry g* of Z. If g is algebraically stable, its dynamical degree is equal to d 
and the translation length of g* is equal to log(d). The Picard-Manin classes [a] 
and [u] corresponding to the end points of the axis of g* satisfy g*[a] = [a]/d 
and g*[w] = d[(J\. To compute explicitly such classes, we start with the class [H] 
of a line in P^. , and use the fact that 

±£[B\ -> c ste M 

when n goes to +oo (see § 4.6.2). 

Assuming that g is a general element of V^, its base locus is made of one point 
Pq of multiplicity d— 1 and 2d— 2 points pi, 1 < i < 2d— 2, of multiplicity 1, and 
similarly for the base locus of g~ l . Note [i? + ] (resp. [E~]) the sum of the classes 
of the exceptional divisors, with multiplicity d — 1 for the first one, obtained by 
blowing-up the 2d — 1 distinct points in Ind(<7) (resp. Ind(g -1 )). We have 

g*[H] = d[H] - [E~], gl[H] = d 2 [H] - d[E~] - g,[E~}, etc. 

Thus, the lines R~ and i?+ of the Picard-Manin space generated by 

97 l+1 [E + ] „„, r .,_ rtrl ^si- 1 ^] 



o. 



[g]-E J and M = [fi] 



d* 4~! d i 



correspond to the end points of the axis of g*. By Lemma 5.5, both infinite 
sums appearing in these formulas are sums of classes of the exceptional divisors 
obtained by blowing up the backward (resp. forward) orbit of Ind(<?) (resp. 
Ind(g -1 )). By construction, [a] and [u] satisfy [a] • [u] = 1 and [a] 2 = [uj] 2 = 0, 
because both of them are on the boundary of Hg- All points on Ax(^) are linear 
combinations u[a] + v[ui] with the condition 

1 = (u[a\ +v[uj}) 2 = 2uv. 

The intersection of [H] with a point on Ax(^) is minimal for u = v = -k= and 
is then equal to \[2 (independently on d); denote by [P] = A=(a + u) the point 
which realizes the minimum. We have 

v/ 2 d & 

Once again, Lemma 5.5 implies the following fact. 
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Fact 5.6. The class [R] is a sum of classes of exceptional divisors obtained by 

3>2 

c 



blowing up distinct points o/Pq (there is no blow-up of infinitely near points). 



Proposition 5.7. Let eq = 0.289. Let d > 2 be an integer and g be a general 
element of Vd- Let [P] be the Picard-Manin class defined above. If f is a bira- 
tional transformation of the plane such that dist(/*[Q], [Q]) < Eq f or a ^ [Q] * n 
{g~ [P], [P],9*[P]}, then f is the identity map. 

The proof uses explicit values for distances and hyperbolic cosines that are 
recorded on Table 1. Using this table, we see that 

cosh(ai + e) < 4 and 02 + e < 04 < 03 

as soon as e < Eq = 0.289. 



cosh(aj) 


m 


3 


oi ~ 1.76274 


V2 
3/V2 

5/(2a/2) 
4 


o 2 ~ 0.88137 = oi/2 

3 ~ 1.38432 

4 ~ 1.17108 

5 ~ 2.06343 



Table 1. Distances and hyperbolic cosines 



Proof. We proceed in two steps. 

First step.— We show that if dist(/*[P], [P]) < eo, then / is linear. 

By the triangular inequality 
dist(U[H},[H]) < d\st(f,[H],MP])+d\st(f,[P],[P]) + d\st([P],[H]) 
< 2d\st([P],[H])+e . 

Recall that cosh(dist([L>i], [D 2 ])) = [D{\ ■ [D 2 ] for all pairs of points [D{\, [D 2 ] 
in Wg and that the degree of / is given by deg(/) = f*[H] • [H]. Using Table 1 
we see that [P] ■ [H] = v2 implies 2dist([P], [H]) = 2a 2 = a\. Taking hyperbolic 
cosines we get 

deg(/) < cosh(ai + e ) < 4. 
We conclude that deg(/) < 3. 

Now we want to exclude the cases deg(/) = 3 or 2. We will use the follow- 
ing remark twice: Since [H] ■ [P] = y/2, we have distQP], [P]) = 02; thus, if /* 
is an isometry and dist(/*[P], [P]) < eo, applying hyperbolic cosines to the tri- 
angular inequality dist(/«[ff], [P]) < d\st{f,[H}, f,[P}) + dist(/*[P], [P]), we get 
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f*[H]-[P] <cosh(a 2 + e )- 

Suppose that deg(/) = 3. Then 

f,[H] = 3[H] - 2[Eh] ~ [E2] ~ m - [E 4 ] - [E 5 ] 

for some exceptional divisors E% above Pq (they may come from blow-ups of 
infinitely near points). By Fact 5.6, all exceptional classes in the infinite sum 
defining [R] come from blow-ups of distinct points; hence 

[2Ei + E 2 + E 3 + E 4 + E 5 ] ■ [R] > -j(-2(d - 1) - 1 - 1 - 1 - 1) > -3. 

Consider [D] the point on the Picard-Manin space such that [P] is the middle 
point of the geodesic segment from [H] to [D]; explicitly 

[D] = 2y/2[P] - [H] = 3[H] - 2[R\. 

We obtain 

MH]-[D] = f.[H].(3[H]-2[R]) 

= 9 + 2[2Ei + E 2 + £ 3 + E 4 + E 5 ] ■ [R] 
> 3. 

On the other hand, f*[H] ■ [H] = 3 because / has degree 3. Since 2\/2[P] = 
[H] + [D] we obtain 

3 + 3 3 

cosh(a 2 + £0) > f*[H] ■ [P] > —pr = —j= = cosh(a 3 ). 

This contradicts the choice of Eq. 

Suppose that deg(/) = 2. We have /*[#] = 2[H] - [Eh] - [E 2 ] - [E 3 ] where 
the [Ej\ are classes of exceptional divisors above Pq . The product /* [H] • [P] is 
given by 

f*[H]-[P] = [2H-E 1 -E 2 -E 3 ]-(V2[H}-^[R]\ 

= 2V2 + ^=[E 1 + E 2 + E 3 ]-[R] 

v2 

where the inequality follows from Fact 5.6: The curves E±, E 2 , and E3 appear at 
most once in [R], so that [E\ + E 2 + £3] • [R] > — 1 — 4 > — |. As a consequence, 

cosh(<22 + £q) > cosh(a4), 
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in contradiction with the choice of eo- 

Thus, we are reduced to the case where / is an element of Aut(P^). 

Second step.— Now we show that if / is in Aut(P^) and dist(/*[(5], [Q]) < eo 
for [Q] in {(^[P^g^P]}, then / is the identity map. 

Applying the assumption to [Q] = g* 1 ^], we get 

dist((^^ 1 ),[P],[P])<6 

and the first step shows that gfg" 1 must be linear. This implies that /(Ind(g)) 
coincides with lnd(g). The same argument with [Q] = g*[P] gives /(Ind(g -1 )) = 
Ind(<? _1 ). If deg(g) > 3, the set Ind(g) is a generic set of points in P 2 -, with 
cardinal at least 5, so /(Ind(g)) = Ind(g) implies that / is the identity map (see 
remark 5.1). Finally, if deg(g) = 2, the set Ind(g) U Ind(<? ) is a generic set of 
6 points in the plane, so if f(lnd(g)) = lnd(g) and f(lnd(g^ 1 )) = lnd(g~ 1 ) then 
again / is the identity map. □ 

Corollary 5.8. If g is a general element ofVd, then Ax(g*) is rigid. 

Proof. Suppose Ax(g*) is not rigid. Choose r\ > 0. By Proposition 3.3 there 
exists / which does not preserve Ax(<7*) such that d(f*[Q], [Q]) < n for all [Q] in 
the segment [g~ 1 [P],g + [P]]. For r/ < eo, this contradicts Proposition 5.7. □ 

Proof of Theorem 5.2. Let g be a general element of V^, with d > 2. Sup- 
pose that /*(Ax(<7*)) = Ax(<7*). If /* preserves the orientation on Ax(g*), then 
{fgf~ l g 1 )* fixes each point in Ax(g), and Proposition 5.7 gives fgf~ x = g. 
Similarly, if /* reverses the orientation, considering (fgf~ 1 g)* we obtain fgf^ 1 = 

Since we know by Corollary 5.8 that Ax(g*) is rigid, we obtain that g* is 

2 1 



tight, hence by Theorem 2.9 g% generates a proper subgroup of Bir(P 2 -,) for large 
enough k. 

We can be more precise on k by reconsidering the proof of Corollary 5.8. The 
rigidity of the axis of g* follows from Proposition 5.7, where the condition on eo 
is independent of d, so by Corollary 3.4 we obtain that Ax(<7*) is (eo/2, 7L(g*))- 
rigid. Recall that 8 is defined in section 2.2.1 by 8 = 45 and that 5 = log(3) 
works for the hyperbolic space Hg; thus we can choose 8 = 41og(3), so that 
148 = 561og(3) = 61.52... < 62. By Lemma 3.2 we obtain that Ax(g*) is also 
(148, P)-rigid, for 

B > max(1369, (28/3)L(g*) + 124). 
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Then, Theorem 2.9 requires kL(g^) > 40B + 12000. Thus, we see that 

/ 60034 _ 10234 \ 

k = max -——,374 + — - — < 86611 
\H9*) L(g*)J 

is sufficient to conclude that g k generates a proper normal subgroup of Bir(P^,). 
Asymptotically, for degrees d with d > exp(10234), we can take k = 375. When 
d > 9, k = 30017 works, etc. □ 

5.2. Automorphisms of rational surfaces. 

5.2.1. Preliminaries. Let k be an algebraically closed field, and A be a rational 
surface defined over k. Let g be an automorphism of A. If (p: X — - » P? is 
a birational map, conjugation by cp provides an isomorphism between Bir(A) 
and Bir(P^) and Lp* conjugates the actions of Bir(A) on Z{X) and Bir(P^) on 
Z = Z(P2). We thus identify Bir(A) to Bir(p2) and Z(X) to Z(P£) without 
further reference to the choice of ip. This paragraph provides a simple criterion 
to check whether g is a tight element of the Cremona group Bir(A). 

As explained in §4.6.3, the Neron-Severi group N 1 (A) embeds into the Picard- 
Manin space Z and g* preserves the orthogonal decomposition 

^ = N 1 (A) R ©N 1 (A) R ± , 

where N 1 (A)r is the orthogonal complement with respect to the intersection 
form. Assume that g* is hyperbolic, with axis Ax(<?*) and translation length 
log(A(g)). By Lemma 4.7, the plane V g such that Ax(g*) = Hj n V g is contained 
in N 1 (A) R . 

Remark 5.9. Let h be a birational transformation of X such that h* preserves 
V g . The restriction of h* to V g satisfies one of the following two properties. 

(1) h* and g* commute on V g ; 

(2) h* is an involution of V g and h* conjugates g* to its inverse on V g . 

Indeed, the group of isometries of a hyperbolic quadratic form in two variables 
is isomorphic to the semi-direct product R X Z/2Z. 

Lemma 5.10. Let g be a hyperbolic automorphism of a rational surface X. 
Assume that 

(i) g* is the identity on the orthogonal complement ofV g in N 1 (A) R ; 
(ii) the action o/Aut(A) on N 1 (A) is faithful. 

Let h be an automorphism of X such that h* preserves V g . Then hgh" 1 is equal 
to g or g^ 1 . 

Proof. Let us study the action of h* and g* on N 1 (A). Since h* preserves V g , it 
preserves its orthogonal complement V„ and, by assumption (i), commutes to 
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g* on V g x . If h* commutes to g* on V g , then h* and g* commute on N 1 (X), and 
the conclusion follows from the second assumption. If h* does not commute to g, 
Remark 5.9 implies that h* is an involution on V g , and that /i*g*(/i*) _1 = (<?*) _1 
on V g and therefore also on N 1 (X). Once again, assumption (ii) implies that h 
conjugates g to its inverse. □ 

Remark 5.11. The plane V g is a subspace of N 1 (X)r, and it may very well 
happen that this plane does not intersect the lattice N 1 (X) (except at the origin). 
But, if gr* is the identity on VJ-, then VJ- and V g are defined over Z. In that 
case, V g n N 1 (X) is a rank 2 lattice in the plane V g . This lattice is g* -invariant, 
and the spectral values of the linear transformation g* € GL(V^) are quadratic 
integers. From this follows that X(g) is a quadratic integer. In the opposite 
direction, if X(g) is a quadratic integer, then V g is defined over the integers, and 
so is Vg 1 '. The restriction of g* to V^~ preserves the lattice V g n N 1 (X) and a 
negative definite quadratic form. This implies that a positive iterate of g* is the 
identity on V^~ . 

Let us now assume that there exists an ample class [D'\ G N (X) fl V g . In that 
case, [D'] and g*[D'] generate a rank 2 subgroup of N 1 (X) n V g and Remark 5.11 
implies that X(g) is a quadratic integer. In what follows, we shall denote by [D] 
the class 

[D]= ; l =[D']. (5.2) 

VWIWY 

This is an ample class with real coefficients that determines a point [D] in Hg. 

Lemma 5.12. Let h be a birational transformation of a projective surface X. 
Let [D'\ £ N'(I) be an ample class, and [D] = [D'\/ ^[D'\ ■ [D'\ £%. // 

cosh(dist(^[Z)], [D])) < 1 + ([D'\ • [D'])- 1 

then h* fixes [D 1 ] and is an automorphism of X. 

Proof. Write h*[D] = [D] + [F] + [R] where [F] is in N 1 (X) R and [R] is in 
N 1 (^)r • More precisely, F is an element of N 1 (X) divided by the square root 
of the self intersection [-D'] 2 , and [R] is a sum ^mj[£y of exceptional divisors 
obtained by blowing up points of X, coming from indeterminacy points of h~ l ; 
the mi are integers divided by the square root of [D 1 ] 2 . We get 

1 < [D] ■ K[D] = [D] • ([D] + [F] + [R]) = l + [D]- [F] 

because [D] does not intersect the \Ej\. The number [D] ■ [F] is a non negative 
integer divided by [D'] 2 . By assumption, this number is less than ([-£>'] • [-D']) -1 
and so it must be zero. In other words, the distance between [D] and /i*[D] 
vanishes, and [D] is fixed. Since [D] is ample, h is an automorphism of X. □ 
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Proposition 5.13. Let g be a hyperbolic automorphism of a rational surface X. 
Assume that 

(i) V g contains an ample class [D'] and 
(ii) g* is the identity on VJ- C\ N 1 (X). 

Then Ax(g*) is rigid. Assume furthermore that 

(Hi) if h E Aut(AT) satifies h*(Ax(g*)) = Ax(g*) then hgh~ l = g or g^ 1 . 

Then any h E Bir(A") which preserves Ax(g*) is an automorphism of X, and g is 
a tight element of B\r(X). Thus, for sufficiently large k the iterate g generates 
a non trivial normal subgroup in the Cremona group Bii^P 2 -,) = Bir(X). 

Note that if the action of Aut(X) on N 1 (X) is faithful then by Lemma 5.10 
condition (Hi) is automatically satisfied. 

Proof. Let [D] E Ax(g*) be the ample class defined by equation (5.2). If the 
axis of g* is not rigid, Proposition 3.3 provides a birational transformation / of 
X such that the distances between f*[D] and [D] and between f*(g*[D]) and 
g*[D] are bounded by ([-D'] 2 ) -1 , and, moreover, f*(Ax(g*)) ^ Ax(g*). Lemma 
5.12 implies that / is an automorphism of X fixing both [D] and g*[-D]- This 
contradicts /*(Ax(<?*)) ^ Ax(g*) and shows that Ax (5*) is rigid. 

Assume now that h E Bir(X) preserves the axis of 5*. Then h*[D'] is an ample 
class, hence h is an automorphism of X. Property (Hi) implies that g is a tight 
element of Bir(X) and the conclusion follows from Theorem 2.9. □ 

In the following paragraphs we construct two families of examples which satisfy 
the assumption of Proposition 5.13. Note that the surfaces X that we shall 
consider have quotient singularities; if we blow-up X to get a smooth surface, 
then the class [D'] is big and nef but is no longer ample. For the first example, 
we work on the field of complex numbers C, while for the second we work on 
any algebraically closed field. 

5.2.2. Generalized Kummer surfaces. Consider Z[t] C C the lattice of Gaussian 
integers, and let Y be the abelian surface C/Z[i] x C/Z[i]. The group GI_2(Z[i]) 
acts by linear transformations on C 2 , preserving the lattice Z[i] x Z[i]; this 
provides an embedding GI_2(Z[z]) — > Aut(l"). Let X be the quotient of Y by the 
action of the group of order 4 generated by rj(x,y) = (ix,iy). This surface is 
rational, with ten singularities, and all of them are quotient singularities that 
can be resolved by a single blow-up. Such a surface is a so called (generalized) 
Kummer surface; classical Kummer surfaces are quotient of tori by (x, y) 1— > 
(—x, —y), and are not rational (these surfaces are examples of K3-surfaces) . 
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The linear map r\ generates the center of the group GI_2(Z[z]). As a conse- 
quence, GI_2(Z[i]), or more precisely PGl_2(Z[z]), acts by automorphisms on X. 
Let M be an element of SI_2(Z) such that 

(i) the trace tr(M) of M is at least 3; 

(ii) M is in the level 2 congruence subgroup of SI_2(Z), i.e. M is equal to the 
identity modulo 2. 

Let g be the automorphism of Y defined by M and g be the automorphism of 
X induced by g. 

Theorem 5.14. The automorphism g: X — > X satisfies properties (i), (ii), and 
(Hi) of Proposition 5.13. In particular, g determines a tight element of Bir(P^); 
hence, ifk is large enough, g k generates a non trivial normal subgroup o/Bii^P 2 -,). 

Proof. The Neron-Severi group of Y has rank 4, and is generated by the following 
classes: The class [A] of horizontal curves C/Z[i] x {*}, the class [B] of vertical 
curves {*} x C/Z[i], the class [A] of the diagonal {(z, z) G Y}, and the class [A,] 
of the graph {(z,iz) eY\ z £ C/Z[i]}. 

The vector space H 2 (Y, R) is isomorphic to the space of bilinear alternating 
two forms on the 4-dimensional real vector space C 2 . The action of g* is given 
by the action of M _1 on this space. The dynamical degree of g is equal to the 
square of the spectral radius of M, i.e. to the quadratic integer 



Mg) = 2 (a + v 7 ^ 

where a = tr(Af) 2 — 2 > 2. Thus, the plane V g intersects N 1 (y) on a lattice. Let 
[F] be an element of V g n N 1 (X) with [F] 2 > 0. Since Y is an abelian variety, 
then [F] (or its opposite) is ample. Since M has integer coefficients, the linear 
map 5* preserves the three dimensional subspace W of N 1 (y) generated by [A], 
[B] and [A]. The orthogonal complement of Vg intersects W on a line, on which 
g* must be the identity, because det(M) = 1. The orthogonal complement of W 
is also a line, so that g* is the identity on V^~ C N 1 (y). 

Transporting this picture in N (X), we obtain: The dynamical degree of g 
is equal to the dynamical degree of g (see [27], for more general results), the 
plane V g surjects onto V g , the image of [F] is an ample class \D'\ contained in 
V g n N X (X), and g* is the identity on V g ± . 

Automorphisms of X permute the ten singularities of X. The fundamental 
group r of X \ Sing(A) is the affine group Z/4Z x (Z[i] x Z[i]) where Z/4Z is 
generated by rj. The abelian group Z[i] x Z[z] is the unique maximal free abelian 
subgroup of rank 4 in V and, as such, is invariant under all automorphisms of T. 
This implies that all automorphisms of X lift to (affine) automorphisms of Y. 
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Let h be an automorphism of X which preserves the axis Ax(g*). Then h* 
conjugates g* to g* or (<?*) _1 (Remark 5.9), and we must show that h conjugates 
g to g or g~ l . Let h be a lift of h to Y. There exists a linear transformation 
N £ G l_2(C) and a point (a, 6)6 7 such that 

h(x,y) = N(x,y) + (a, b). 

The lattice Z[i] x Z[i] is iV-invariant, and iV conjugates M to M or its inverse 
M _1 , because /i* conjugates g* to 5* or its inverse. Then 

hogoh^ 1 = M ± (a;,y) + (Id-M ± )(a,6). 

On the other hand, since h is a lift of an automorphism of X, the translation 
t : (x, y) I—)- (x, y) + (a, 6) is normalized by the cyclic group generated by r/. Thus a 
and b are in (l/2)Z[i]. Since M is the identity modulo 2, we have M(a, b) = (a, 6) 
modulo Z[i] x Z[i]. Hence /iojo/1" 1 = ^ and, coming back to X, h conjugates 
g to g or its inverse. □ 

Remark 5.15. The lattice of Gaussian integers can be replaced by the lattice 
of Eisenstein integers Z[j] C C, with j 3 = 1, j / 1, and the homothety 7/ by 
r](x,y) = (jx, jy). This leads to a second rational Kummer surface with an action 
of the group PSI_2(Z), and a statement similar to Theorem B can be proved for 
this example. 

5.2.3. Coble surfaces. Let k be an algebraically closed field. Let S C P£ be a 
rational sextic curve, with ten double points rrii, 1 < i < 10 ; such sextic curves 
exist and, modulo the action of Aut(P^), they depend on 9 parameters (see [28], 
the appendix of [26], or [10]). Let X be the surface obtained by blowing up the 
ten double points of S: By definition X is the Coble surface defined by S. 

Let 7r: X — > Ff, be the natural projection and E$, 1 < i < 10, be the excep- 
tional divisors of it. The canonical class of X is 

10 
[K x } = -3[H] + Y^[Ei} 

i=l 

where [H] is the pullback by 7r of the class of a line. The strict transform S' of 
S is an irreducible divisor of X, and its class [S'] coincides with — 2[J^x]; more 
precisely, there is a rational section Q of 2Kx that does not vanish and has 
simple poles along S' . 

Remark 5.16. Another definition of Coble surfaces requires X to be a smooth 
rational surface with a non zero regular section of —2Kx'- Such a definition 
includes our Coble surfaces (the section vanishes along S') but it includes also 
the Kummer surfaces from the previous paragraph (see [21]). Our definition is 
more restrictive. 
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The self-intersection of S' is —4, and S' can be blown down: This provides 
a birational morphism q: X — > Xo; the surface Xq has a unique singularity, at 
m = q(S'). The section Q, defines a holomorphic section of K^ that trivializes 
2Kx in the complement of m; in particular, H°(X, —2Kx) has dimension 1, and 
the base locus of —2Kx coincides with S' . The automorphism group Aut(X) acts 
linearly on the space of sections of —2Kx, and preserves its base locus S' . It 
follows that q conjugates Aut(X) and Aut(X ). 

The rank of the Ner on- Sever i group N 1 (X) is equal to 11. Let W be the 
orthogonal complement of [Kx] with respect to the intersection form. The linear 
map q* : N 1 (Xo) — > N X (X) provides an isomorphism between N 1 (Xo) and its 
image W = [Kx] 1 - C N X (X). Let 0'(H 1 (X)) be the group of isometries of the 
lattice N 1 (X) which preserve the canonical class [-K"x]- 

We shall say that S (resp. X) is special when at least one of the following 
properties occurs (see [18], [10] page 147): 

(1) three of the points rrii are colinear; 

(2) six of the points rrii lie on a conic; 

(3) eight of the rrii lie on a cubic curve with a double point at one of them; 

(4) the points m% lie on a quartic curve with a triple point at one of them. 

Theorem 5.17 (Coble's theorem, see [19, 20]). Let k be an algebraically closed 
field. There is a non empty Zariski open subset U of the space of rational sextic 
curves ScPj with the following property: If S is a point of U and X is the 
associated Coble surface then the morphism 

Aut{X)^0'{N 1 (X)) 

f -»• /* 

is infective and its image is the level 2 congruence subgroup of 0'(N 1 (X)). When 
the characteristic o/k is different from 2, the set U coincides with the set of sextic 
curves with ten double points which are not special. 5 

Remark 5.18. Let Amp(W) be the set of ample classes in N 1 (Xo)®R ~ W<g>R. 
This convex cone is invariant under the action of Aut(X), hence under the action 
of a finite index subgroup of the orthogonal group 0(W). As such, Amp(W) is 
equal to {[D] G W | [D] 2 > 0, [D] ■ [H] > 0}. 



The proof is similar to an analogous result concerning generic Enriques surfaces, and follows 
from [17], [10], and [19]. The automorphism group of X can be identified with a normal subgroup 
of O' (N 1 (X)) which is generated, as a normal subgroup, by an explicit involution. The main 
point is to realize this involution by an automorphism; to do it one constructs a 2 to 1 morphism 
from X to a Del Pezzo surface and takes the involution of this cover. For this purpose, one has 
to excludes all special sextics and, in characteristic 2, one needs to check that the cover is not 
purely inseparable (this adds one extra condition). 
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Let S be a generic rational sextic, and X be its associated Coble surface. 
Theorem 5.17 gives a recipe to construct automorphisms of X: Let ip be an 
isometry of the lattice W; if tp is equal to the identity modulo 2, and ip[H] ■ [H] > 
0, then ip = g* for a unique automorphism of X. Let us apply this idea to cook 
up a tight automorphism of X. 

Lemma 5.19 (Pell -Fermat equation). Let Q(u,v) = au 2 + buv + cv 2 be a 
quadratic binary form with integer coefficients. Assume that Q is non degenerate, 
indefinite, and does not represent 0. Then there exists an isometry <j) in the 
orthogonal group Oq(Z) with eigenvalues X(Q) > 1 > 1/\(Q) > 0. 

Sketch of the proof. Isometries in Oq(Z) correspond to units in the quadratic 
field defined by the polynomial Q(t, 1) G Z[i]; finding units, or isometries, is 
a special case of Dirichlet's units theorem and amounts to solve a Pell-Fermat 
equation (see [23], chapter V.l). □ 

Let [Di] and [D2] be the elements of N 1 (X) defined by 

8 

[Z>i] = 6[iT]-[£ 9 ]-[£ 10 ] -£>[£*], 

i=l 

[£> 2 ] = 6[fl] - [Er] - [E 8 ] - J2 2 [^]- 

i^7,8 

Both of them have self intersection 2, are contained in W, and intersect [H] 
positively; as explained in Remark 5.18, [D\] and [D2] are ample classes of Xq. 
Let V be the plane containing [D\] and [£^2], and Q be the restriction of the 
intersection form to V . If u and u are integers, then 

Q(u[D x ] + v[D 2 ]) = (u[Di] + v[D 2 ]) ■ (u[D{\ + v[D 2 ]) = 2u 2 + 8uv + 2v 2 

because [Di] ■ [D2] = 4. This quadratic form does not represent 0, because its 
discriminant is not a perfect square. From Lemma 5.19, there is an isometry cf> 
of V with an eigenvalue \(4>) > 1. An explicit computation shows that the group 
of isometries of Q is the semi-direct product of a cyclic group Z, generated by 
4>, with 

#[£>i]) = 4[D X ] + [D 2 ], 0([£> 2 ]) = -[A], 

and the group Z/2Z generated by the involution which permutes [Di] and \D%\. 
The second iterate of 4> is equal to the identity modulo 2. Coble's theorem 
(Theorem 5.17) now implies that there exists a unique automorphism g of X 
such that 

(1) 5* coincides with <\> 2 on 1/; 

(2) g* is the identity on the orthogonal complement V^. 
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The dynamical degree of g is the square of \(<fi), and is equal to 7 + 4v3. Prop- 
erties (i), (ii), and (Hi) of Proposition 5.13 are satisfied. Thus, large powers of 
g generate non trivial normal subgroups of the Cremona group: 

Theorem 5.20. Let k be an algebraically closed field. Let X be a generic Coble 
surface defined over k. There are hyperbolic automorphisms of X that generate 
non trivial normal subgroups of the Cremona group Bir(Ak) = Bir(P?). 

As a corollary, the Cremona group Bir(P^) is not simple if k is algebraically 
closed, as announced in the Introduction. 

6. Complements 

6.1. Polynomial automorphisms and monomial transformations. The 

group of polynomial automorphisms of the affine plane, and the group of mono- 
mial transformations of P£. were both sources of inspiration for the results in this 
paper. We now use these groups to construct hyperbolic elements g of Bir(PQ) 
for which ((g)) coincides with Bir(P^,). 

6.1.1. Monomial transformations. Consider the group of monomial transforma- 
tions of P£. By definition, this group is isomorphic to GI_2(Z), acting by 

N :(x,y)^(x a y b ,x c y d ) 



c d, 

in affine coordinates (x,y). The matrix —Id corresponds to the standard qua- 
dratic involution a(x,y) = (l/x, 1/y). 

If one considers PSI_2(Z) as a subgroup of PSI_2(R) — Isom(]H 2 ), it is an inter- 
esting exercise to check that all hyperbolic matrices PSI_2(Z) are tight elements 
of PSI_2(Z). However, when we see GI_2(Z) as a subgroup of the Cremona group, 
we obtain the following striking remark. 

Proposition 6.1. The normal subgroup of Bir(P^) generated by any monomial 
transformation is never proper: Lf g is not the identity, then ((g)) = Bir(P£). 

Remark 6.2 (Gizatullin and Noether). If N is a normal subgroup of Bir(P^) 
containing a non trivial automorphism of P£, then N coincides with Bir(P^,). 
The proof is as follows. Since Aut(P^.) is the simple group PGI_3(k) and N is 
normal, N contains Aut(P£). In particular, N contains the automorphism h 
defined by 

K x ->v) = i 1 - M -y) 

in affine coordinates. An easy calculation shows that the standard quadratic 
involution a satisfies a = (ha)h(ha)~ l ; hence, a is conjugate to h, and a is 
contained in N. The conclusion follows from Noether's theorem, which states 
that a and Aut(P£) generate Bir(P£) (see [32], §2.5). 
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Proof of Proposition 6.1. Let g = ("^) be any non trivial monomial map in 
Bir(Pcj). The commutator of g with the diagonal map f(x,y) = (ax,/3y) is the 
diagonal map 

g- l r l gf: (x,y) -> (a l - d (3 b x,a c (3 l - a y). (6.1) 

Thus, the normal subgroup ((g)) contains an element of Aut(Pc) \ {Id} and 
Remark 6.2 concludes the proof. □ 

6.1.2. Polynomial automorphisms. As mentioned in the Introduction, Danilov 
proved that the group Aut[A^]i of polynomial automorphisms of the affine plane 
A£ with Jacobian determinant one is not simple. Danilov's proof uses an action 
on a tree. Since Aut[A£]i is a subgroup of Bir(P^), we also have the action of 
Aut[A£]i on the hyperbolic space Iffg. It is a nice observation that g £ Aut[A£]i 
determines a hyperbolic isometry of the tree if and only if it determines a hy- 
perbolic isometry of Hj: Hyperbolicity corresponds to an exponential growth of 
the sequence of degrees deg (<?"). 

Fix a number a 6 k* and a polynomial p € k[y] of degree d > 2, and consider 
the automorphism of A£ defined by h(x, y) = (y,p(y) — ax). This automorphism 
determines an algebraically stable birational transformation of P£, namely 

h[x:y:z] = [yz d ~ l : P(y, z) - axz d ~ l : z d ], (6.2) 

where P(y,z) = p(y/z)z . There is a unique indeterminacy point Ind(/i) = 
{[1 : : 0]}, and a unique indeterminacy point for the inverse, Ind(/i _1 ) = {[0 : 
1 : 0]}. This Cremona transformation is hyperbolic, with translation length 
L(h*) = log(d); in particular, the translation length goes to infinity with d. 

Proposition 6.3. For all integers d > 2, equation (6.2) defines a subset H^ C V^ 
which depends on d+2 parameters and satisfies: For all h in H^, h is a hyperbolic, 
algebraically stable Cremona transformation, but the normal subgroup generated 
by h coincides with Bir(P?). 

Proof. The automorphism h is the composition of the de Jonquieres transforma- 
tion (x, y) i—T- (P(y) — ax, y) and the linear map (x, y) \- )■ (y, x). As such, h is an 
element of V^. If / denotes the automorphism f(x,y) = (x,y + 1), then 

(h~ l o / o h)(x, y) = (x - a~ l ,y) 

is linear (thus, the second step in the proof of proposition 5.7 does not work for 
h). As a consequence, the commutator f~ 1 h fh is linear and ((h)) intersects 
Aut(Pi^) non trivially. The conclusion follows from Remark 6.2. □ 

Note that for h in H^ and large integers n, we expect ((h n )) to be a proper 
normal subgroup of the Cremona group: 
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Question 6.4. Let k be any field. Consider the polynomial automorphism 

g: (x,y) t-> (y,y 2 + x). 

Does there exist an integer n > (independent of k) such that {{g n )) is a proper 
normal subgroup of Bir(Pj^)? 

The main point would be to adapt Step 2 in the proof of Proposition 5.7. 

6.2. Projective surfaces. The reason why we focused on the group Bir(P^,) 
comes from the fact that Bir(A) is small compared to Bir(P^,) when X is an 
irrational complex projective surface. The proof of the following proposition 
illustrates this property. 

Proposition 6.5. Let X be a complex projective surface. If the group Bir(A) is 
infinite and simple, then X is birationally equivalent to C x P 1 (C) where C is a 
curve with trivial automorphism group. 

Sketch of the proof. Assume, first, that the Kodaira dimension of X is non nega- 
tive. Replace X by its unique minimal model, and identify Bir(X) with Aut(X). 
The group Aut(X) acts on the homology of X, and the kernel is equal to its 
connected component Aut(X) up to a finite index group. The action on the ho- 
mology group provides a morphism to GL n (Z) for some n > 1. Reducing modulo 
p for large primes p, one sees that GL n (Z) is residually finite. Since Aut(X) is 
assumed to be simple, this implies that Aut(A) coincides with Aut(A) . But 
Aut(X) is abelian for surfaces with non negative Kodaira dimension (see [1]). 
Thus Bir(X) = Aut(X) is not both infinite and simple when the Kodaira di- 
mension of X is > 0. Assume now that X is ruled and not rational. Up to a 
birational change of coordinates, AT is a product Pi-, x C where C is a smooth 
curve of genus g{C) > 1. The group Bir(X) projects surjectively onto Aut(C). 
By simplicity, Aut(C) must be trivial. In that case, Bir(A) coincides with the 
infinite simple group PGI_2(.M (C)) where M{C) is the field of meromorphic func- 
tions of C. The remaining case is when X is rational, and Theorem A concludes 
the proof. □ 

6.3. SQ-universality and the number of quotients. A group is said to be 
SQ-universal (or SubQuotient-universal) if any countable group can be em- 
bedded into one of its quotients. For example, the pioneering work [30] proves 
that the free group over two generators is SQ-universal. If G is a non elemen- 
tary hyperbolic group, then G is SQ-universal. This result has been obtained 
by Delzant and Olshanskii in [13] and [38]. It seems reasonable to expect that 
the Cremona group Bir(PQ) is also SQ-universal, with a proof similar to Delzant 
and Olshanskii's proofs, but for Bir(P^) acting on Hj? instead of a hyperbolic 
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group acting on itself. Dahmani and Guirardel [11] have announced a general 
result which, thanks to what is proved in our paper, should easily imply the 
SQ-universality of the Cremona group. We thus refer to a forthcoming paper by 
Dahmani and Guirardel, together with Osin, for SQ-universality. Note that SQ- 
universality implies the existence of an uncountable number of non isomorphic 
quotients. The following weaker result is a direct consequence of Theorem A. 

Proposition 6.6. The Cremona group Bir(PQ) has an uncountable number of 
distinct normal subgroups. 
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